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《华罗庚文集》编委会 


王元万哲先陆启铿杨乐 
李福安贾朝华尚在久周向宇 



《华罗庚文集》序言 

2010年是著名数学家华罗庚先生诞辰100周年.值此机会，我们编辑出版《华 
罗庚文 集》， 作为对他的美好纪念. 

华罗庚先生是他那个时代的国际领袖数学家之一，也是中国现代数学的主要奠 
基人和领导者.无论是在和平建设时期，还是在政治动荡甚至是战争年代，他都抱 
定了为国家和人民服务的宗旨，为中国数学的发展倾注了毕生精力，受到了中国人 
民的广泛尊敬. 

华罗庚先生最初研究数论，后将研究兴趣拓展至代数和多复变等多个领域，取 
得了一系列国际一流的成果，引领了这些领域的学术发展,产生了广泛持久的影响. 
他从一名自学靑年成长为著名数学家，其传奇经历激励了几代中国数学家投身于数 
学事业. 

华罗庚先生为我们留下了丰富的精神遗产，包括大量的学术著作和研究论文 • 
我们认为,认真研读这些著作和论文，是深刻把握华罗庚学术思想精細的最佳途径 • 
无论对于数学工作者还是音年学生,其屮许多内容都是很有启发和裨益的. 

华罗庚先生担任中国科学院数学研究所所长30余年，他言传身教，培养和影 
响了一批国际水平的数学家，他的学术思想和治学精神 d 经成为数学所文化的核 
心.自2008年起以中国科学院数学研究所为基础成立的中国科学院华罗庚数学重 
点实验室，旨在继承和弘扬华罗庚先生的学术思想和治学精神，积极推动中国数学 
的发展.为此，我们选 辑华罗 庚先生的著作和论文作为实验室的首批出版物，今后 
还将陆续推出更多优秀的数学出版物. 

在出版《华罗庚文集》的过程中，我们得到了各方面的关心和支持，包括国家 
出版基金的资助，在此我们表示深深的 感谢. 同时，对于有关人员在策划、翻译 
和审校等方面付出的辛勤劳动,对于科学出版社所作的大量工作，我们表示诚挚的 
谢意_ 


中国科学院华罗庚数学重点实验室 
《华 罗庚文集》编委会 
2010 年 3 月 
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Algebra and geometry * 


Z. X. Wan 

Finite Groups. Early in 1938, while Hua taught at the Southwest Association 


the topics studied were p-groups. In [46, 81], he introduced the concept of the rank 
of a p-group. A p-group 0 of order p n is said to be of rank a, if the maximum of the 
orders of its elements is p n ~ a . Using this concept he proved that a pseudo-basis exists 
in p-groups, i.e., if p > 3 and n ^ 2a + 1, then every element of g can be expressed 
uniquely as 

G = A s A s 0 rA i a a _- 1 l - A s 1 I , 0^S^p n ~ a ~ 1, 0< 

where A is of order p n ~ a and = 1. With the aid of pseudo-bases, and of a modified 
form of the enumeration principle of P. Hall, he proved several “Anzahl” theorems. 
For instance, if g is a group of order p n and rank a (p > 3, n ^ 2a 4-1), then (i)fl 
contains one and only one subgroup of order p m and rank a (2a + 1 《 m 《 n); 
(ii) 0 contains p a cyclic subgroups of order p m (a < m < n - a - 1); (iii) the number 
of elements of order < (a 彡 m 彡 n - a) in 0 is equal to p m+a . The second and 
third results improved theorems of G. A. Miller and A. A. Kulakoff respectively. 

Skew Fields. Since Hamilton's first example of a non-commutative division 

algebra - the quaternion algebra - division algebras have received a great deal 

of attention. By comparison, infinite dimensional division algebras - skew fields 

— —were neglected; until, around 1950, with his perceptive direct algebraic method 
Hua proved several remarkable theorems in this area. 

First, in 1949’ Hua^ 88 ' proved that “every semi-automorphism of a skew field 
is either an automorphism or an anti-automorphism” (by a semi-automorphism of 
a skew field we mean a one-to-one mapping a from the skew field into itself with 
the properties (a + b)° — a° + b°, (aba)° — a°b a a° and 1° = 1). This theorem 
was referred to as the beautiful theorem of Hua by E. Artin in his book Geometric 

* Reprinted from Loo-Keng Hua Selected Papers. New York: Springer-Verlag, 1983: 281-284. 
The references in this article are those in this book. 






华罗庚文集代数卷 II 


Algebra. From it HimJ 88 則 deduced also the fundamental theorem of 1-dimensional 
projective geometry over a skew field. In 1950【 97 1 he extended his theorem to semi- 
homomorphisms of rings without zero divisors. 

Secondly, in 1949 L. K. Hua 【 91 】 gave a straightforward proof that “every proper 
normal subfield of a skew field is contained in its center” • This result appears in the 
literature as the Cartan-Brauer-Hua theorem. Before the work of Hua and Richard 
Brauer, Henri Cartan's proof had used the complicated device of Galois extensions 
over subfields. By contrast, Hua's proof requires only the elementary identity: If 
ab _ ba, then 

a= (&-i -(a- m** - 1)) (a-Va -(a - ma - 1)) _ ' 

In 1950, Huai 96 ! proved also that "Hf a skew field is not a field, then its multi¬ 
plicative group is not meta-abelian”. 

Classical Groups. Early in 1946, L. K. Hua 【 73 】 published his first paper on 
automorphisms of classical groups, in which he determined the automorphisms of 
a real symplectic group. Subsequently, in 1948, hel 85 ! determined the automorphisms 
of a symplectic group over any field of characteristic not 2. The method of Hua for 
determining the automorphisms of symplectic groups can be applied also to classical 
groups of other types; but since Dieudonn^ published his results on the automorphisms 
of classical groups in 1951, Hua* 101 ' restricted himself to publishing only solutions, by 
his own method, to a series of problems left open by Dieudonn4. The first of these 
was the determination of automorphisms of GL2(/f), K being an arbitrary skew field 
of characteristic ^ 2. 

Besides GL2(K), Hua【 101 l determined also the automorphisms of SL4(/f) and 
PSL>4(/^), where K ia & skew field of characteristic not 2, and the automorphism of 
Oj (K, /), where K is a. field of characteristic not 2 and / is a quadratic form of index 
2. Afterwards, Hua and Z. X. Wan【 105 I determined the automorphisms of SL2(/0 
and PSLa(if), where is a skew field of characteristic 〆0, the automorphisms of 
SL4(/f) and PSL4(/f), where 尺 is a skew field of characteristic 2, and they proved 
also the nonisomorphism of certain linear groups. 

Hua’s work on the automorphisms of classical groups, shows mastery of the 
techniques of matrix calculation. The procedure is to start with the low-dimensional 
cases and to proceed to the higher-dimensional cases by induction, as in [85], for 
SP 2 n(K). 

About the structure of classical groups, Hua extended the usual unitary group to 



the case when the basic field is not necessarily commutative but has an involutive anti¬ 

automorphism. He proved that the group TU n {K\S) generated by unitary transvec- 
tions modulo its center is a simple group, if S has index ^ 1 and that TU n (KiS) is 
the commutator subgroup of U n {K\S), if the index of S satisfies n ^ 2V ^ 4. 

Hua and I. Reiner* 102,106 ) also determined the automorphisms of GL n (Z) and 
DGL n (Z), which was the start of the work on the automorphisms of classical groups 
over rings. They 【 92 】 also proved that GL n (Z) is generated by three elements, SL n (Z) 
by two elements, and Sp2n(Z) by four elements for n > 2. Formerly Poincare had 
stated without proof that Sp2 n (Z) is generated by elementary matrices of two simple 
types, and later Brahana^ had proved this by showing that every element of Sp2n(Z) 
is expressible as a product of matrices taken from some finite set of matrices. 

Geometry of Matrice^ 67J6 ~ 78 ' 93,99 ^. Study of this topic was initiated by Hua and 
relates to Siegel's work on fractional linear transformations. In it, the points of the 
space are matrices of a certain kind, for instance, rectangular matrices, symmetric 
matrices or skew-symmetric matrices of the same size. There is then a group of 
motions in this space, and the problem is to characterize the group of motions by 
as few geometric invariants as possible. First, he studied the geometry of matrices 
of various types over the complex or real fields. Later, he extended his results to 
the case when the basic field is not necessarily commutative and discovered that 
the invariant “coherence” is alone sufficient to characterize the group of motions of 

the apace. Take his paper [99] as an example. He proved the fundamental theorem 

of affine geometry of rectangular matrices: Let 1 < n ^ m. Then the one-to-one 

mappings from the set of n x m matrices over a skew field K onto itself preserving 
coherence {two matrices M and N are said to be coherent, if the rank of M—N is 1) is 
necessarily of the form 

Z x = PZ°Q + R, (1) 

where P = P( n ) and Q = Q( m ) are invertible matrices, R is an wxm matrix, and cr 
is an automorphism of K\ if n = m, then besides (1) we have also 

Zi = PZ n Q + R, 

where t is an anti-automoTphisTn of K. From this theorem he deduced the fundamental 
theorem of the projective geometry of rectangular matrices (the Grassmann space), 


* Poincar6 H. Rend Circ Mat Palermo, 1904, 18: 45-110. 
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and he determined the Jordan isomorphism of total matrix rings over skew fields < 
characteristic 一 2 and the Lie isomorphism of total matrix rings over skew fields < 
characteristic 一 2, 3. 













苏家驹之代数的五次方程式解法不能成立之理由* 

五次方程式经 Abel, Galois 之证明后，一般算学者均认为不可以代数解矣，而 
《学艺》七卷十号载有苏君之“代数的五次方程式之解法” 一文，罗欣读之而研究 
之,于去年冬亦仿得“代数的六次方程式之解法”矣.罗对此欣喜异常,意为果能成 
立,则于算学史中亦可占一席地也，惟自思若不将 Abel 言论驳倒,终不能完全此种 
理论，故罗沉思于 Abel 之论中，凡一阅月，见其条例精严,无懈可击，后经本社编辑 
员之暗示，遂从事于苏君解法确否之工作，于六月中遂得其不能成立之理由，罗安 
敢自秘，特公之于世，尚祈示正焉. 


解法简述 


用 Sylvester 之分离消去法 (diabylic method of elemination) 将普通形 


X 5 + PlX 4 + P 2® 3 + P3X 2 + P\X + p 5 = 0 

化为可解形 X 6 + P x X 4 + P 2 X 3 + P 3 X 2 + P 4 X + P 6 = 0 (中有巧 =0, P 3 = 0, P 2 2 = 
5/*4)，而 ■有 = + + n 2 x 2 + n 3 i 3 + n 4 x 4 之关系. Pi , 巧，巧，尸4,尸 5 为 

no , ni , n 2, n 3, n 4 之一次、二次、三次、四次、 五次齐次函数 .忾 = 0,即 n 0 可以 
ni , n 2，《3， n 4 之一齐次函数表之，以之代入 P 2 yPz , Pi , P ^, 则得 ni , n 2, n 3) n 4 之二、 
三、四、五次齐次函数，而巧之一般形可写为 

Ain? + A 7 .ni + i4 3 n| + >Un| + Asnfn 2 + A^n\nz + A7n\rn 4 - A 6 nlni + ■A 9 n!rt 3 
+ >lion2 n 4 + -^11«3«8 + ^12«3«2 + + Awnjni + ^15714712 + Aien^na 

+^17«1^2«3 + ^18«1«2«4 + ^19«1«2«4 + (I) 


式中 Ai,...，i4ao 为 pi,• • ■ 1P5 之函数为己知者 • 

若令等于下式 

(oini + 02 « 2 )( 03«1 4- «4«2 + Os* 1 ! + °6«4 + «7«1«2 + 08^1^3 + 09^1^4 + 010«2«3 
+011«2«4 + 012«3«4) + (at3«3 + 014^4) (ai5«l + + Ol7«3 + °18«4 + 019^1^2 

+a20 n l n 3 + 021^*1^4 + <*22打2卩3 + 023^2^4 + 024«3 n 4)i (II) 


式中<*1， …， (》24为未定系数. 


* 科学， 1930, 15: 307-309. 
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再设 ai«i + a2ti2 = 0, oi3«3 + ai4«4 = 0,代入/§ = 5 尸 4式中，则此式为叱,《4 
之四次齐次函数，解之，则得 n 2 ,n 4 之比值，由此可作得 n 0 ：m：n 2 ：n 3 : n 4 之值, 
故普通形可化为上之可解形，换言之，即五次方程式可得而解矣. 

谬误点 

罗研究上意知其谬误在巧中，即⑴不能等于（II)也.夫求未定系数 O !,--, 
a 24 , 原文亦有求之之二十方程式，罗为便利讨探计,特分之为四类,转录于下： 

(― )aia 3 = Ai, a 2 a4 = A 2 , 

03O2 + OiQ7 = ^5i aja! + 0207 = ^8 ； 

(—) 013017 = Az, auai8 = Al, 

017014 + 013024 = 乂13， ai8<»13 + ai 4 024 = A\t\ 

(三） 013018 + Ol<»3 = 义6， 014015 + aia» = Ay, 

aiau + «2«9 = Oi8 — autti9> «2an + ai4aie = ^4i0i 

0-20,10 + (213Q16 = ^9> a!aio + 0208 = <*17 ~ Ol3fll9 ； 

(四） aia6 + oisajo = An, + 013022 = A\^, 

02012 + ai4022 = j4l9 — 0l3023i + O13O2I = Mo — 014^20) 

aiOg + 014021 = i4u, 0206 + 014023 = •Ajs. 

依原所谓假 Or, 024 贝! I 由( _ )，（二）得 之值，则第 

二类乃为 a8，ai5,a9，aii,ai6,<Mo 之联立一次方程式（设 aig 为已知)，以行列式解之， 
知其各分母悉为 

ai a]3 0 0 0 0 

0 ai4 ai 0 0 0 

0 0 02 aj 0 0 

A = 

0 0 0 a2 ai4 0 

0 0 0 0 aj3 a2 

ai 0 0 0 0 ai 

然 

ai oi3 0 0 0 0 

0 ai4 ai 0 0 0 

0 0 a2 ai 0 0 

0 0 0 02 ai4 0 

0 0 0 0 ai3 a2 

02 0 0 0 0 oi 


au ai 0 0 0 

0 a2 ai 0 0 

0 0 <22 ai4 0 

0 0 0 ai3 02 

0 0 0 0 ai 



苏家驹之代数的五次方程式解法不能成立之理由 


ai 0 
<12 ai 

0 Q2 
0 0 
0 0 


0 0 

0 0 

a 14 0 

<*13 °2 

0 ai 


而 08. Ol5) 09* Oil) Ol«» OlO = ^/A. 

因 △ = 0 , 故 ag , axs , 09 , au , aie ,< tio 非不定即无限大，故 （ I ) 等 （ II ) 之谬论不 
攻自破矣.换言之，即 ft 为零不能解得二一次式，故此法亦不能解五次方程 式也. 



Geometries of matrices. I. Generalizations of von 
Staudt’s theorem* 

It was first shown in the author's recent investigations on the theory of auto- 
morphic functions of a matrix-variable that there are three types of geometry playing 
important roles. Besides their applications, the author obtained a great many results 
which seem to be interesting in themselves. 

The main object of the paper is to generalize a theorem due to von Staudt, which 
is known as the fundamental theorem of the geometry in the complex domain. The 
statement of the theorem is: 

Every topological transformation of the complex plane into itself, which leaves 
the relation of harmonic separation invariant, is either a collineation or an anti- 
collineation. 

Since the fields and groups may be varied, several generalizations of von 
Staudt's theorem will be given. The proofe of the theorems have interesting 
corollaries. 

The paper contains also some fundamental results which will be useful in suc¬ 
ceeding papers. 

The interest of the paper seems to be not only geometric but also algebraic, for 
example we shall establish the following purely algebraic theorem: 

Let iNl be the module formed by n-rowed symmetric matrices over the complex 
field. Let T be a continuous (additive) automorphism ofTl leaving the rank unaltered 
and r(iX) = Then r m on inner automorphism of 9Jt, that is, we have a 

nonsingular matrix T such that 

r(X) = TXT. 

The author makes the paper self-contained in the sense that no knowledge of the 
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I. Geometry of symmetric matrices 

Let 屯 be any field. In I, II, and III, capital Latin letters denote n x n matrices 
unless the contrary is stated. But on the contrary, we use to denote an 

n x m matrix, and M( n ) = M( n,n ). J and 0 denote the identity and zero matrices 
respectively. 

Throughout I, we use 



which are 2n-rowed matrices. 

1. Definitions 

We make the following definitions. 

A pair of matrices {Z\, Z 2 ) is said to be symmetric if 

(Z 1 ,Z 3 )df(Z 1 ,Z 3 y = 0, 

that is, if Z\Z' 2 = Z 2 Z[. The pair is said to be nonaingular if (ZiyZ^) is of rank n. 
A 2n x 2n matrix T is said to be symplectic if 

嫩 = ff . 

Explicitly, let 

% 

then we have 

AB' = BA\ CD' = DC\ AD' - BC' = I. 

Further, it may be easily verified that 

H-W) 

is also symplectic. 

We define 

{W u W 2 ) = Q{Z u Z 2 ) , l 

to be a symplectic transformation, where Q is nonsingular and T is symplectic. 
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Since 

W 2 ) f = Q(Z 1 ,Z 2 )%^ , (Z 1 ,Z 2 )'Q', 

a symplectic transformation carries symmetric (nonsingular) pairs into symmetric 
(nonsingular) pairs. 

We identify two nonsingular symmetric pairs of matrices (Zi,Z 2 ) and (Wi, W 2 ) 
by means of the relation 

(Z 1 ,Z 2 ) = Q(W U W 2 ). 

It is called a point of the space. The space so defined is unaltered under symplectic 
transformations, which may be considered as the motions of the space. 

If Z\ and W\ are both nonsingular and if (JVi, W 2 ) = Q(Zi,Z2)1, let 

W = z = -z^z 2 , 

then W and Z are both symmetric and 

Z = (AW + B)(CW + D)~ l . 

Thus a symmetric pair of matrices may be considered as homogeneous coordinates 
of a symmetric matrix. The terminology “geometry of symmetric matrices’，is thus 
justified. 

2. Equivalence of points 

Theorem 1 Any two nonsingular symmetric pairs of matrices are equivalent. 
Or what is the same thing ： every nonsingular symmetric pair is equivalent to {1,0). 
Proof Let (Zi, Z 2 ) be a nonsingular symmetric pair. 

(1) If Z\ is nonsingular, we have 

(Z u Z 2 ) = Z 1 {I,Z^Z 2 ) = Z 1 {I,0) 

where S — Z^Z 2 is symmetric, and then 

I S 

0 1 

is symplectic. 

(2) Suppose Z\ to be singular. We have nonsingular matrices P and Q such that 

( J(f) Q(r,r»-r) \ 

0( 一 o (-) ) 
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(.W 1 ,W 2 ) = P(Z 1 ,Z 2 ) 


Q o 
, 0 Q '- 1 . 


Since 


W 2 = pz 2 q'~ 


( s ⑺ m( r - n - r > ’ 

^ g(n-r,r) j(n-r) 


^ 0 ) 

is symplectic, {W\, W 2 ) is nonsingular and symmetric. Consequently a is symmetric 
and 9 is a zero matrix. 

Let 

{UuU2) = {W u W 2 )( V 


Then 


8< r > 0 
0 /( n_r > 


Ui = Wi, U 2 = -W 1 S + W 2 
Since (Ui,U 2 ) is nonsingular, *( n_r ) is nonsingular. Let 
(Yi,V 2 ) = {UuU i )( I T M 


0 m 
0 t , 


then 

( J< r ) 

Vi = U x + U 2 =\ Q 

which is nonsingular. By (1), we have the theorem. 

3. Equivalence of point-pairs 

Definition Let (Zi, Z 2 ) and {Wi,Wi) be two nonsingular symmetric pairs of 
matrices. We define the rank of 



W 2 )' = ZxWi - Z 2 W[ 
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to be the arithmetic distance between the two points represented. Evidently, the 
notion is independent of the choice of representation. Further, it is invariant under 
symplectic transformations. In fact, let 

(Z ： ,Z；) = Q(Z lt Z 2 )% = R{W u W 2 )% 

then 


W^y = Q(Z U Z 2 )1^(W U W 2 )'R ； = 

In nonhomogeneous coordinates, the arithmetic distance between two symmetric 
matrices W, Z is equal to the rank of W — Z. 

Theorem 2 Two point-pairs are equivalent if and only if they have the same 
arithmetic distance. What ia the same thing: every point-pair with arithmetic distance 
r is equivalent to 

(1,0), (IJ r ), 


J( r ) 0 


Proof By Theorem 1, we may assume that the point-pairs are of the form 
(/,0), {Z U Z 2 ). 


The arithmetic distance being r, it follows that Zi is of rank r. We have two 
nonsingular matrices P and Q such that 


QZ 2 P 


/<*■) 

0 


0 

0 


Ir- 


Then 

Q(Z 1 ,Z 2 ) ^ P， q 1 ° p j =(T,/ r ) 

and 

Q (/，0)( 尸二) 

Since (T, / r ) is a nonsingular symmetric pair, we have, consequently, 

: r-K) f 1 

( o p(— J ’ 



where s is symmetric and p is nonsingular. Then 


’/(••) _t p -i ' 
0 p- 1 


(T,I r ) 


(( 


0 

0 J( n — r > 


r | 


((T A), 


l 

I-s 0 、 


'-(Ur) 


(1,0) 


I 

I-s 0 ' 
0 0 


=(/,0). 


V V 0 0 , 

is symplectic, we have the result. 

Definition The points (Xi,X 2 ) with singular X\ are called points at infinity 
(or symmetric matrices at infinity). Finite points are those with nonsingular X\. 

Lemma Any finite number of points may be carried simultaneously into finite 
points by a symplectic transformation, if 9 is the field of complex numbers. 

Proof (1) Given any symmetric pair of matrices (TijTa), we have a symplectic 
matrix 

V Ti T 2 ) 

In fact, by Theorem 2, we have a symplectic % such that 

m ， T 2 ) = Q(-/,0)T. 




Then 

(Pi P 2 N 
\ T i > 

which is evidently symplectic. 
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(2) For a fixed point the manifold 

det((A- 1 ,X 2 )ff(Z 1 ,Z 2 ) , ) = 0 
is of dimension n(n + 1) — 2. Let 

(义1，义2)，… I / j ) 

be p given points. Then we have p manifolds 

det (( AuAMZi , Z 2 Y ) = 0,…， detKL !， Z 2 )’）= 0. 

In the space, there is a point (Ti,T 2) which is not on any one of the manifolds. The 
transformation 

(Y 1 ,Y i ) = Q(X 1 ,X 2 )^ ^ 工 ) 

carries evidently the p points into finite points simultaneously. 

4. Equivalence of triples of points 

Definition 1 A subspace is said to be normal if it is equivalent to the subspace 
formed by symmetric matrices (in nonhomogeneous coordinates) of the form 

( 4 r) o 1 

V o o ("- r > )' 

The least possible r is defined to be the rank of the subspace. 

Definition 2 A triple of points is said to be of degeneracy d = ra—rif it belongs 
to a normal subspace of rank r. 

Evidently degeneracy is invariant under symplectic transformations. 

Theorem 3 In the complex field, two triples of points are equivalent if and 
only if they have the same degeneracy and the arithmetic distances between any two 
corresponding pairs of points are equal. 

Proof Evidently, if two triples axe equivalent, they have the same degeneracy 
and the arithmetic distances between any two corresponding pairs of points are equal. 
We prove the converse in six steps. 

(1) Every triple with arithmetic distances n, n, r is equivalent to 


0 , /, 


-_ r ( r ) o 


(in nonhomogeneous coordinates) 



distance between A and B. Let A, B, C be the three points of the triple. Then 
r(A,B) = r(A,C) — n. 

By Theorem 2, we may write in homogeneous coordinates 
A = {1,0), B = (0,7), C = {ZuZi). 

Since r(A, C) =n and Z 2 is nonsingular, we may write C as 

(5,/), 

where 5 is a symmetric matrix of rank r. We have a nonsingular matrix T such that 


(1,0) ' 

( r 0 \ 

( rr (/, o ) 

(0,/) 

( or - 1 ): 

(o./) 

{S,I) 


V (Jr, I) 


Thus the triple is equivalent to 

(/,0), (0,/), (Jr,/). 

Since (in the nonhomogeneous coordinate system) 

0 , /, ~Ir 

is a triple with distances n, n, r, we have the theorem. 

(2) Every triple of points with arithmetic distances n, s, t is equivalent to 

0 0 \ 

0 0 0， 

0 0 1 ^ ) 

where p + q = s,n-q = t (obviously, s + t^n). 

In fact, we may assume that 


-4 = (7,0), B = (/,/), C = (Z U Z 2 ). 
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We may determine two nonsingular matrices U, V such that 

/ J( r ) 0 、 

哪 o o )， 

where r is the rank of Z^. If we set 
G = 

the relations 


v '- 1 0 、 

V - v'- 1 V , 


U(Z l ,Z a )G 

imply that we may assume that 

= P, Zi 


U(I,0)G = UV , ~ 1 (I,0), 
U(I,I)G = UV(I,I), 

/( r > 0 、 


(Ko 


0 , 


» o' 
. 0 0 , 


Owing to the symmetry, we have 

o 

where S is symmetric and T is nonsingular. Further, since 

- W r T r ){{ S ： ° o )) = (( S r ：)•{'： ° o )) 


we may assume that 


Zx 


S( r > 

0 




0 

0 


In the normal subspace of rank r，the points (J( r > ， 0( r )) ， (J( r > ， /( r )) ， (5( r )，/( r )) are, by 
(1)，equivalent to 


(/ (r) ,0W), (/W,/M), 



0 

0( r _P) 


)) 


Thus, we have, in nonhomogeneous coordinates, 



The transformation 


( /(«•) 0 \( z _( 0 (r) 0 W( / (r) 

^ o 八一 （ 0 J ( f ) J 八。 


0 


W 


carries the three points to the required form. 

(3) Now we are going to prove that any three points are equivalent to 


■4 = 0 ， B = J»i + • • • +6a, C = ci + ■■■ + ①， 

where b u and c v are unit matrices of degree (y), multiplied with a factor 1 ，0 , or -1. 
(1) and (2) are special cases of this. We shall consider another special case with 


A = 0, B 


0 

M' 




N 

0 


where 



They form a triple with distances 2m, 2m, 2m. 

Now we are going to establish that there exists a symmetric matrix S such that 


the transformation 

W = Z(SZ +1)- 1 


will carry the three points to 


■4 = 0 ， B 


0 





where Bi is nonsingular. In fact 5 is given by 


0 


0 

0 

0 



1 0 0 

0 0 0 

0 0 0 

-10 0 
0 0 0 


0 

0 

1 

0 



and so on. The general form may be obtained easily. Applying the results obtained 
in (2) to 

0("-”， S 卜”， c { n_1> ， 

①+ and denote direct sums. 
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and 

rCT’ = Cj -}- • • ■ + ca> 

where 

(61/, c v ) 

is either the pair discussed in (4) or the pair discussed in (3), hence the results in (3). 

(6) By a rearrangement and some evident modifications, 
with degeneracy t, we have 


Geometries of matrices. I. Generalizations of von Staudt’s theorem 
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A = (^) + 0 ⑷ + 0( r ) + 0(*) + 0 ⑴， 

+ 0 (,) + 1^ + /(*> -i- 0 (t) , 
c= - 1 ^ + -i- o< r > -i- /(*> -i- o (t) , 


which is the only possible form. The arithmetic distances between two points are 
given by 

o= r(B, C) = p + q + r, 
b= r(C, A) = p + q-\- 8, 
c = r(A, B) = p + r + 8. 

Thus, for given t, a, b, c, if the equations are soluble, the solution is unique. We have 
therefore the theorem. 

The conditions for solubility are 


n 一 t b t c t 

a + b + c ^2(n — t). (1) 

In terms of a “triangle” we have the following theorem. 

Theorem 4 A triangle of degeneracy t with sides a, b, c exists if and only if 
(1) holds. If it exists, it is unique apart from equivalence. 

Incidentally, we have 


a + b > 2(n — <) — c ^ c, 

equality holds if and only ii c = a + b = n — t. 

The “triangle^relation” 

o + 6 > c, 6 + c > o, c + a^b 


does not guarantee the existence of triangles with a given degeneracy, for example, 
n = 2, t = 0, a = b = c = l. But we have the following theorem. 

Theorem 5 Given the lengths of three sides a, b, c « n), where the sum of 
every two is greater than the third one, there are A non-equivalent triangles, where 

^ _ f [(o + 6 + c)/2] - max(a, b, c) + 1, for n > [(a + 6 + c)/2] ①， 

^ n — max(a, 6, c) + 1, for n < [(a + 6 + c)/2]. 

Proof From a + b^c, b + c^ a, c+a^b, v/e have 
① [x] denotes the integral part of x. 
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a + b + c^ 2 max(a, b, c). 

There always exists a < such that 

a + b + c^ 2(n — t)^ 2 max(a, b, c). 

Then 

max(0, n — [(a + 6 + c)/2]) ^ t ^ n — max(a, b, c). 

Thus, the number of t's is equal to 

n — max(a, b, c) — max(0, n — [(a + 6 4 - c)/2]) + 1 
=min(n, [(a + 6 + c)/2]) — max(a, b, c) + 1. 

Corollary 1 If one of the sides is of length n, the triangle is unique. 
Corollary 2 If the sum of two sides is equal to the third, then the triangle is 
unique. 

5. Equivalence of quadruples of points 

Definition Let Z\,Z 2 , Z 3 , Z\ be four points in the nonhomogeneouscoordinate- 
system. The matrix 

(Zx- Z 3 ){Z X - Z 4 )(Z 2 - Z z )~ l 

is defined to be the cross-ratio-matrix of the four points, and it is denoted by 
(Zi, Zi\ Z 3 , Z 4 ). 

It is defined only when Z\ - Z\ and Z 2 - Z 3 are nonsingular. 

In the homogeneous coordinate-system, we let Pi, P 2 , P 3 , P 4 be four points with 
coordinates 

(Xi.ro, ( x 2 , y 2 ), (x 3 ， r 3 ), (x 4 ,n). 

In terras of 

(p it Pj) = (x^diXuYiY, 
the cross-ratio-matrix is defined by 

(Pl,P2 ； P^,PA) = (Pl,P3)(Pl,P4)- l (P2,P4)(Fil,P3)- 1 , 
provided that it is not meaningless. 


Let be the point with coordinates 

(w) = QiiX^Yi)% 

where I is symplectic; then 

(P ：， P；) ={X ； ,Y ； )^(X ； ,Y；y 

=Q j {X j ,Y j )^(X i ,Y i YQi = Qi{Pi,Pj)Qi- 

Therefore 

(p ： ,p ； -,p ； ,p；) =(i>r, p^)(Pi, 

=Q 3 (Pi,P3)Q'iQ ， ： 1 {Pi,P4)Q4Q ： 1 (P^P4)Q ， 2Q ， 2 1 (P^ Pa^Qa 1 
=Q 3 (Pi,P2 ； P 3 ,Pa)Q3\ 
and we now state the following theorem. 

Theorem 6 In an algebraically closed field, two quadruples of points, no two 
of the points having arithmetic distance less than n, are equivalent if and only if their 
croas-ratio-matrices are equivalent. 

In order to prove Theorem 6, we need to establish the following theorem. 
Theorem 7 In the algebraically closed field, any quadruple of points, no two 
of which have arithmetic distance less than n, is equivalent to 

0. oo. ^ 2 ' a*. S ' bit 

where 


Oi = 

' 0 … o i N 
0 … 1 0 

， bi = 

^ 0 0 •• 

0 0 - 

, 0 入 i 、 

入 i 1 


j ... o oj 


、入 < 1 • 

•00) 


Proof In homogeneous coordinates, we may write the four points as 

(o,/), (/,o), (z lt z 2 ), m,w 2 ). 

Since no two of the arithmetic distances are less than m, Zi, Z 2 , W\ , W 2 are all non¬ 
singular. We may write them in the nonhomogeneous coordinates as 
0, 00 , 5i, S 2 . 

We have a nonsingular matrix T such that 

TSir^J2 ，ai ' TS 2 r^^2'bi. 
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The theorem follows. 

The proof of Theorem 6 is now evident. 

Remark The equivalence of quadruples in any field seems to be more difficult. 
The condition in Theorem 6 is insufficient for the real case (a signature system is 
required). 

Definition We define a quadruple of points satisfying 

(Pl,P2；P 3 ,P4) = ~I 

to be a harmonic range. 

Evidently harmonic range is invariant under a symplectic transformation. 

6. Von Staudt’s theorem in the complex number field 
Now we let 龟 be the field formed by complex numbers. 

We use Z to denote the conjugate complex matrix of Z. The transformation 

(W U W 2 ) = Q(Z u Z 2 )T 

carrying a symmetric pair (H^j, W 2 ) into a symmetric pair (Zi,Z^) is called anti- 
aymplectic if Q is nonsingular and I symplectic. 

Theorem 8 A transformation satisfying the following conditions: 

(1) one-to-one and continuous; 

(2) carrying symmetric matrices into symmetric matrices', 

(3) keeping arithmetic distance invariant, 

(4) keeping the harmonic relation invariant 

is either a symplectic or an anti-symplectic transformation. 

Proof Let T be the transformation considered. Taking three points A, B,C 
(symmetric matrices), no two of which have arithmetic distance less than n, let 
Ai, Bi, Ci be their images. By (3), the arithmetic distance between any two of 
Ai, Bi, Ci is n. Let Ti and be two symplectic transformations carrying respec¬ 
tively A, B, C and A\, Bi, Ci into 0, /, c», in accordance with Theorem 3. Then, 
without loss of generality, we may assume that 

0 = r(o), i = r(/), cx) = r(oo). 

Since 

Z, Z\, {Z + Zi)/2, 00 
form a harmonic range, we have 


r(z) + r(z 1 ) = r(z+z 1 ). 



Consequently, 


T[tZ) = rT(Z) 


for all rational r. By continuity, this holds for all real r 
Now we introduce the following notations: 


Eu = (p a t), Pst = 


1, if s = t = i, 
0, otherwise 


Eij = (4«t), q»t = 


1, if s = i, t = j’ or s = _?•，f 
0’ otherwise. 


Let 


r(Eu) = Mi. 

Since is of rank 1 and symmetric, we have 

Mi = (A<i, • • • , Aj n )’(Aii ， … ， Ai n ). 


Let 

Then 


A =(〜). 


J = r(/)=^r(£；“) = ^A^ 

i=l i=l 

n 

= y^(Ati, ■• - ,Ain)’(A<l， ••- > Ain) 

i=l 

== I e XijXik I 

i=l \i=l / 


That is, A is an orthogonal matrix 

(Aji ，…， AjnJA 7 = • • • i Sin), 

where 6ij is Kronecker's delta. Thus 

AT^i)^ = Eu. 


Let 


A(Z) = \T{Z)A.', 

then A has the same property as V, that is, 
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and 


Let 


A(Z + Z!) = A(Z) + A(Zi), 

0 = A ⑼， J = A(/), oo = A(oo) 

Eu — △(£■“)• 

A(£ij) = M = (m 5t ), * ^ j, 


M is of rank 2. Since 

Eij + XEu + Ejj/X 

is of rank 1, owing to the invariance of arithmetic distance, the matrix 


M + XEa + Ejj/X 


(1) 


is also of rank 1 for all A. We are going to prove that M = 土岛 j. In fact, we may 
assume that * = 1, j = 2. The two-rowed minor of (1) 

TO 11 + A Wil2 
mi2 饥 22 + 1 /A 



for any A, that Ls 


m\\m2i — m.i2 + mu/X + m. 22 ^ + 1 = 0, 


that is 


mu = ni22 = 0, m\2 = 士 1. 

Further 

mu + A 
mi* 

for all A, then mtt = 0, mu = 0 for all i ^ 3. Finally 

= 0 if (M)#(l ， 2), 


0 m at 

mat 0 


= 0, t^3, 


then m st = 0 for (s,t) ^ (1,2). Hence, we have 


M = 


Thus 


A(Eij) = 士 Eij. 




Geometries of matrices. I. Generalizations of von Staudt’s theorem 
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Let 


Then 


D = [&!,-•• ,£ n ], 6 P = ± 1 . 

DA(Eij)D > = ±£iEjEij. 


Thus we may choose e properly so that 


DAiEu)^ = E u . 


Let DAD' = II. Then II has all properties of A and further 


n(^ii) = Eu- 

Now we consider Without loss of generality we take (i, j) = (2, 3). Then, if 
n(f? 23 ) = -f? 23 . we have 

n(£n 4 - E33 + Ei2 + Ei3 + 五 23) = 忍 11 + 说 3 + 五 12 + 丑 13 _ 芯 23 ， 


Bince 

1 1 1 

1 0 e =-(e-l ) 2 ， 

1 e 1 

which is equal to zero for e = 1 and not zero for e = -1. Consequently the ranks of 
Eu + e 33 + E u + Ei 3 + E 23 and E n + E 33 + E 12 + E 13 - thz are not equal. This is 
impossible. 

Thus, we have 

n(X) = X 

for all real X (if we do not use continuity, it holds for all rational X). We may assume 
r to be n. 

Further for real Y, the four points Yi, -Yi, Y, -Y forma haimonic range, while 
T{Y) = Y, r(-y) = -r, 

thus we have 

= -yr(iy)- 1 . 


In particular, 


(r(*/)) 2 = 
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Then 

r(t/) = ij, 

where J is an involutory symmetric matrix, that is J 2 = J and J = J'. We have a 
matrix T (not necessarily orthogonal) such that 


Let 


we have then 


= 9(Z), 


4>(il) = */. 

Let 

V(Z) = 

Then 

▽(0) = 0, V(/) = I, ▽( 00 ) = 00 ， 

so the ranks of Z and V(Z) are equal. By the method used before, we have 


Vi = B'VB, 


such that 

Vi(X) = X, 

for all real X. Thus, we have finally that 

r(x + iY) = r{X) + r(»y) = x+ iMya, 

where A is independent of X and Y. 

Now we have 

A'YAY- 1 = Y{A'Y A)- 1 , 

that is, 

(A'YAY- 1 ) 2 = I, 

for all real Y. Here we introduce a lemma. 

Lemma Let A be a nonsingular matrix. If 

{A'YAY- 1 ) 2 = I 

for all symmetric Y then A = pi, where p = ±1 or 土 i. 




If the lemma is true, then 


r(X+iK) = X + iY oi X -iY. 

The theorem is proved. 

Proof of the lemma (1) We have a nonsingular matrix T such that 


and 

B = Ji + • • • + «7i/, 
where Jj is a Jordan matrix of degree n “ Evidently 

(B’(r'yr)B(riT )- 1 ) 2 = ^(a'yay - 1 ) 2 ^ 1 = i. 

Thus it is sufficient to prove the theorem for B instead of A. 

(2) We shall prove n< = 1. In fact, if 

/ A 1 … O' 

0 入 … 0 

Ji=.. . 

* 0 0 … A 


(JiUil ) 2 = 


fX 2 X 0 … 0 、 /A 2 (A 2 + 1) 2A 3 + A A 2 

A 1 + 入 2 入 … 0 


、 0 0 0 … 1 + A 2 / 

which is impossible. Thus = 1, that is, 

B = ,e n ]» G 勞 0 ， 

which is a diagonal matrix. 

(3) Putting 
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we have 

( 卜 1 0 ^ A A Ai 0 )( 0 制 2£〒£ 2 ( £1 - £2 )、= (1 0’ 

\\ 0 勿八 1 0 八 0 e 2 )\l -lJJ 0 s\e\ 厂 、 0 1 , 

which implies 

ej = £2• 



B = el, c^O. 

Then A = el. For Y = I, y/e have e 4 = 1, that is e = ±1, ±i. The lemma is thus 
completely proved. 

7. Remarks 

The following results are contained in the proof of Theorem 8: 

Theorem 9 Let $ be the complex field, and let fDt be a module formed by 
symmetric matrices over Let T be an additive continuous automorphism of DJI 
leaving the rank invariant, so that T satisfies ： 

(i) r(X) €®l, 

(ii) T(X + Y) = r(X) + r(y), i/XemandY^m ； 

(iii) r(iX) = ir(X); and 

(iv) r(X) has the same rank as X. 

Then r(^T) is an inner automorphism, that is 

r(X) = TXT' 

for certain T. 

In the case of the real field the situation is more complicated. In Theorem 9, we 
require an additional condition that the signature of T(X) is the same as that of X. 

The analogue of Theorem 8 in the real field is more complicated. Since Theorem 7 
is not true in case of the real field, degeneracy and lengths of sides do not characterize 
the equivalence of triples of points, for example, there does not exist a real symplectic 
transformation F satisfying 

、 ，、 / 2 0 \ 广 -20、 

r ( 0 ) = 0 ’ r ( oc ) = oo, r( o _ 2 ) = ( 0 _ 2 j- 

In fact the transformation satisfying the first two of these relations is of the form 
T(Z) = CZC\ 



signature of a triple, we may obtain an analogue of Theorem 9 in the real field. 

II. Geometry of skew-symmetric matrices 

Throughout II, we use 



We let n = 2m. 

8. Notions 

A pair of matrices (Z\, Z-i) is said to be skew-symmetric if 

(Zu z 2 )^ l (z l , z 2 y = o, 

that is, 

ZiZ f 2 = -Z 2 Z[. 

A 2n x 2n matrix 1 is said to be ffi-orthogonal, if 

WT'=5i. 

We define 

{,Wu W 2 ) = Q(Zt, Z 2 )I 

to be an 5i-orthogonal transformation if Q is nonsingular. 

The transformation carries nonsingular skew-symmetric pairs into nonsingular 
skew-symmetric pairs. 
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Two point-pairs are equivalent if and only if they have the same arithmetic 
distance. 

We may also define the cross-ratio-matrix of four points Pi, Pi, P 3 , P 4 , Pi — 
(Xi,Yi)(i = l,2,3,4)by 

(Pi, P 3 )(Pu P4)~ 1 (P2, P 4 XP 2 , P 3 )-\ 

where 

(P u Pj) = (Xj, YiY， 

The analogue of Theorem 6 is also true. 

If the cross-ratio-matrix is equal to —I, we define Pi, P 2 , P 3 , Pa to be a harmonic 
range. 

9. An algebraic theorem 

On the ground of similarity, the following statement seems to be true. 

Let r be a continuous (additive) automorphism of the module formed by all skew- 
symmetric matrices, such that r(iX) = ir(X), and that the rank is left invariant. 
Then r(X) is an inner automorphism. 

Unfortunately, this statement is false and so the situation becomes more compli¬ 
cated. For n = 2, 


r 0 

a 

b c 、 


f 0 

a 

b 

d 、 

—a 

0 

d e 


—a 

0 

c 

e 

-b 

-d 

0 f 


-b 

— c 

0 

/ 

\ ~ c 

—e 

~f 0 ) 


l ~d 

—e 

-/ 

o / 


is an automorphism but not an inner automorphism. 

It is an automorphism of the required kind, since the principal minors form equal 
sets, say 

{af -be + cd) 2 ; a 2 , b 2 , c 2 , d 2 , e 2 , / 2 . 

It is not an inner automorphism. In fact, we write 

dHV), 

where P and R are two-rowed skew-symmetric matrices. Suppose it is an inner 
automorphism, that is, that there exists a nonsingular matrix 



such that 


( A 

( p 


(A B\' _ 

( p Q'\ 

\C d) 

\~Q' 


[c d)- 

\-Q R) 


for all P, Q, R. 

In particular, if P = /i = 0, Q = /, we have 


(a 



( a b ) 


' 0 M 

\c 

d) 

L _/ 0 J 

U 

1 



Combining (1) and (2), we have 

( A B\(Q -P\^( Q' ~ P \( A B ^ 
D )\R Q 1 )~ \ R Q )\C D) 

Putting P = R = 0, we have 

AQ = Q'A, BQ' = Q'B, CQ = QC, DQ' = QD 

for any Q. Consequently, we obtain 

B = 01, C = 7 /, A = D = 0. 


⑴ 


(2) 


But 



(p q) 

( 0 ^ V 

.1 ^ 2r 

-PiQ' \ 

\ 7 / 0 ； 

V-^ R) 

U 7 ° J 

V HiQ 

i 2 p ) 


which, in general, is not equal to 

( P Q， ). 

\-Q R ) 

Thus the automorphism is not an inner automorphism. 

The above argument suggests that in general we might have m — 1 basic auto¬ 
morphisms: 

(i) a \4 ^ 023 , other elements invariant; 

(ii) ai 4 ^ a 2 3 , a 16 口 a 2 s, other elements invariant; 

( iii ) au ^ a23 , ai 6 口 025 , ai 8 ^ « 27 , other elements invariant ; 


(m - 1) au ^ a 2 3 , …， ai, 2 m ^ a ， 2 , 2 m-i, other elements invariant. 
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Such a reasonable suggestion is a false one, since for m > 3, ll ai4 ^ 023 " does 
not keep the rank invariant, for example, 


n 


□ 

n 

a 

■ 

n 

□ 


i 

Ml! 


19 

Q 


B 



p 

D 


■ 



n 





B 


n 


■ 




a 

i 


El 




■ 

■ 

d 

a 

1 

■ 

■ 

n 

B 


tsi 
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B 


■ 


B 

D 

圈 



Here one matrix is singular while the other is nonsingular. 

Theorem 10 Let $ be the field of complex numbers. Let W be the module 
formed by all skew-symmetric matrices over Let r be a continuous (additive) 
automorphism of ^ leaving the mnk invariant and r(iX) = ir(X). Then, for m ^ 2, 
r is an inner automorphism. For m = 2 there exists a nonsingular matrix T such 
that 

r(X) = TX V T', 


where X v is either X or 



Ql2 

0 

—Ol4 
—024 


«13 


014 

0 

-034 


Q23 

«24 


034 I 

0 / 


Proof (i) Evidently, the automorphisms 


Y = TX V T 

satisfy the requirement, where T is nonsingular, 
(ii) T^e additive property may be stated as 


r(jf + y) = r(^)+r(y), ⑴ 

for any two X and Y belonging to 9JI. Putting AT = y = 0, we have 

r ⑼ = 0 . ⑵ 


It is also very easy to deduce that 


r(rX) = rT(X) 


⑶ 












= QT(X)Q', then is an automorphism satisfying the properties give 
em and 

ri(J) = J, (< 



ead of Tj (in the following we shall repeat this procedure by the simp: 
we may let r satisfy ⑷”）. 




















Consider the two pairs of matrices A, J and l J. oince i(A — aj) = l (/i.) — aj, 
the characteristic roots of r(A — \J) are also Ai, ••- , A m (each is a double root). We 
have a nonsingular matrix M such that 

MT{A)M' = A, MJM' = J. (5) 

Now we are going to prove that M can be chosen independent of the A*s. Write 
M = We have 
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where with Aj = 1 and Aj = 0 for j ^ i. In this expression, only the 

(2i — l)th and 2ith columns are significant. Let P be a matrix having (2i — l)th and 
2ith columns in common with Mi for i = 1,2, … ,n. Then 


Putting Ai = = Am = 

Now we may let 


r ⑷ = PAP 1 . 
e find that P is nonsingular. 

r(A) = a, 


( 0 

Ai U.. 

0 

Am ) 

V -^1 

0 ) 

V -^n 

0 ) 


(iv) The theorem is evident for m = 
Now we take m = 2. Let 


( 0 

a 

b 

c 、 


( 0 

a 1 

b' 


—a 

0 

d 

e 


—a' 

0 

d' 

e' 

-b 

—d 

0 

/ 


-b' 

-d' 

0 

f 

~ c 

—e 

-/ 

o ) 



-e' 

-r 

o / 


0 

a 

-A 


b 

c 




-a + 

A 

0 


d 

e 

= 0, 



-b 


_d 


0 f 

-M 



—c 


—e 

-f + f* 

0 i 



’_M) 

— be + dc = 

0, if and only if 




1 

0 

a' 

-A 


1/ 

d 

1 



-a'+ \ 

-V 


0 d! e' 

-d! 0 f-fi 

—e’ -f +n 0 


a = f = f, 

be — cd = b'e' — dd!. 


⑺ 

⑻ 


Now we consider 




where stands for zero-matrix a 

or has no essential significance in 

for the rest of the paper). Then A 
than or equal to 1, that is 


has two characteristic roots 0 and 1. 
1, such that 

PMQ 

Therefore 



(: 



and 



Since 



We deduce easily that 


Prom (8) we have 
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In particular, we have 



Since 

/ ( d，)" 2 0 0 0 

o (<o _1/2 o o 

0 0 (rf)- 1/2 0 

、 0 0 0 ⑷ 1/2 
f (d ; ) 1/2 0 0 0 

0 ⑷ _1/2 o 0 

x o o («o _1/2 o 
、 0 0 0 ⑷" 2 
we mav let 




for = 1 and 0, so we have either 

T(X) = X 
or 

F{X) = Xr. 

(v) For the sake of simplicity, we give the proof for m = 3. The method is valid 
for any m. 





Geometries of matrices. I. GeneraIi 2 ations of \ 



the determinants of 






Since M\ is of rank 2, we have |P| = 0. Since 


0 

- I 

I - Mi 

is of rank 2, we have |P — /| = 0. There is also a matrix Q such that 


(10) 



Thus we may assume that 


/ i n \ 1 

< 1 0 a ’i5 、 

■ ； o * 1 


0 0 O25 026 

* ■ • I 

= 

a 35 a 36 



a 45 a 46 

\ / 

1 * * J 


By consideration of the ranks of the previous matrix and the matrix given in (10) we 
find 

°15 = a， 16 = a 25 = a 26 = a 35 = a 36 ^ a 45 = a 46 = 0- 
Consider again a general skew-symmetric matrix M and its image Afj = r(M). 
By the same method used for m = 2, we have either 

°16 a， 16 ^ 
a 25 °26 

a 35 a 36 

<»45 °46 

0 <*56 

0 J 

°15 °16 ^ 

°25 °26 

«35 a 36 

°45 a 46 

0 ^56 

0 / 


Mi = 


0 Oi2 <*13 Ol4 
0 023 a 24 


I o a 12 ai3 o 2 3 
0 014 024 

0 a 34 

0 
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/?) r(M) = a / 2 


Repeating the process for 


we obtain either 


<*15 Oi 6 
<*25 <*26 . 


。25 口 i tt25 ， Ol6 口 Ol6i 


»25 ^ Ol6> Ql6 °25- 

For the equivalence of “叫 口 023 " and 口 fl25 n , we have three cases: (a) r(JW)= 
Mi, Mi is obtained by replacing a 3s , 03 c, a 45 , a 46 by aL, a' M , a\ b , a'^ in M, 


0 ai2 Ol3 023 Ol5 a 25 

014 024 ai6 026 

0 a 34 a 35 °36 

0 a 45 °46 

0 a 56 

0 / 

(a) We leave 035 , 03 s, 045 > a 46 arbitrary. Putting ai 6 = 024 = <*34 = 1 ， 025 = x, 
and the others equal to 0, we have det(M) = (x — 03 s) 2 , det(Afi) = (x — a' 35 ) 2 . 
We have det(M) = 0 if and only if det(Afi) = 0, that is 035 = a' 35 . Next putting 
Oi 6 = 023 = 034 = 1 , 025 = x i and the others equal to 0, we obtain 045 = 045 . 
Putting ai 5 = a24 = ai 4 = 1, 026 — x, and the others equal to 0 and putting 
ai 5 = «23 = 034 - 1 , «26 = x and the others equal to 0, we have respectively 

°36 = a 36; °46 = °46- 


( 7 ) 


r(M) = M3 = 


\ / 

1 2/3/4 50 
a a a a a 

5 5 5 5 

ala2ak,a4o 
10240340 



Thus we have 


r(M) = M. 
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(0) and ( 7 ) Putting 024 = <134 = 1. flie = ^25 = x and others equal to 0, we have 
035 = a' 35 . Further, if we put 

<*i2 = ai3 = 0 , au =x, ais = 1 ， ai 6 = 0 , 023 = y, <124 = 025 = 0 , 026 = 1 , 

Q34 =0, 035 = -1, 036 — 045 = <*4 6 = 0 ， O56 = 1 , 

then 

d(M) = (x(y - l)) 2 , d(M 1 ) = (y(x- l)) 2 . 

By putting x = —1, y = +1, we see that this is impossible. 

The general proof may be arranged in the following steps: 

(a) Dividing the matrix into m 2 2-rowed matrices; 

(b) Choosing the first row of the small matrices as in the case m = 3 and applying 
the analogous method as above to the image; 

(c) Determining the other small matrices by the method given for m = 3 (from 
( 10 ) et seq.); 

(d) Considering the 6 -rowed minors we find that the exceptional case appearing 
for m = 2 cannot exist for rra > 3. 

10. Another generalization of von Staudt’s theorem 
The transformation 

(W U W 2 ) = Q(Z U Z 2 )Z 

is called anti-orthogonal if Q is nonsingular and X is 3i-orthogonal. 

Lemma Let A be a nonsingular matrix. Suppose that 

(A^AY- 1 ) 2 = I 

holds for all skew-symmetric Y. For m = 1, A is a matrix of determinant ±1. For 
m > 1, then 

A = pl, or A = pT[l, - - 
where p — ± 1 , ±i, and conversely. 

Proof (i) The result is evident for m = 1. 

For m > 1,A = pi evidently satisfies the equation. Now we prove that A = 
pT[\, - • • , 1, — 1 ]T -1 satisfies the equation. 

We write 

o)' (rrr)_1= (5 o)- 
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Then 

(0 / 广 （ -t/ 0 八 -〆 o J _ L -t/yi* -v'v* ) 

Further 




Since v*'v = —1，we have 


+ 2vv*') 2 = 1 + Aw*' + 4w* W’ = 

hence the result. 

(ii) As in the proof of the lemma of §6, we may assume that 

A = J\ + J2 + • • • » 

where J„ is again of degree n„. The number of odd n^'s is always even. 

(iii) We consider the case 

A = J("), 

where n is even. Write 

p n q ), P = P {n/2 \ Q = q [n/2 \ 

Op ； 
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implies (j/p ) 2 = J( n / 2 〉，which is impossible for n > 2 (that is m > 1 ). 
(iv) Let 

A = J1 + J2, «i + «2 = n, 
where tii and are both odd. Let nj = 712. Write 

-(：：)■ -(： T )- 

Then, we have 


{A'YAY - 1 ) 2 = 


(p" 9) 2 * 


which is possible only for m =n2 = !• Further let rii > «2 - We write 


P Q 


,Y = 


0 

-I 0 


(^YAY- 1 ) 2 = 


(pV) 3 


which is impossible for «i > 3 . For ni = 3 and ri2 = 1 , we have consequently 

'A 1 0 0 N 

0 A 1 0 

0 0 A 0 

, 0 0 0 /X y 

Taking 


,n = —A, A 4 = 1. 


we find this also to be impossible. 
Thus each of the numbers 


must be either 1 or 2 . 

(v) It is easily seen that no two of the n u 's can be 2 . In fact 
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say. Taking 



we find this to be impossible. 

(vi) Further, if one of the n v 's is 2, then n is equal to 2. In fact, suppose that 


A 


Ai 1 
0 Ai 


入 2 0 

0 A 3 


Taking 


we have 




A 2 = - 入 3 , = 1. 


Taking further 

' 0 0 1 1 \ 

y= 0010 
-1-10 0 ' 
、一 1 0 0 0 J 

we have X 2 = A 3 . Both results cannot hold simultaneously, 
(vii) Suppose n ^ 4. Let 


A = [Ai, A 2 , A 3 , A 4 ]. 


Taking 


0 0 11 

0 0 10 

- 1-100 
-10 0 0 


(^YAY- 1 ) 2 


( A?A| Aj(AiA4 + A2A3)(A3—A 4 ) 


A 2 A 3 A3(AiA4 + A2A3)(Ai—A 2 ) 

A?A1 ) 


⑴ 
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which implies 


A?A= = Xl\l = 1. 


Since Ai, A 2 , A 3 , 入 4 can be permuted, we have 

= 1 for all i,j (* ^ j). 


Thus 

Aj = 1, Af = A2 = A3 = = ±1. 

In general, A = [Ai, • • - , An], where = • ■.= 入三 =±1. By choosing a suitable 
p in the lemma, we may consider the case with 


A] = A 2 = = 1. 


If among the A’s there occur two positive and two negative numbers, we take Ai = 
A 4 = 1, A 2 = A 3 = -1. Then (1) is impossible. Thus we have the lemma. 

Theorem 11 A transformation satisfying the following conditions: 

( 1 ) one-to-one and continuous-, 

(2) carrying skew-symmetric matrices into skew-symmetric matrices; 

( 3 ) keeping arithmetic distance invariant, 

and 

(4) keeping the harmonic relation invariant 

is for n / 4 either ^i-orthogonal or anti-orthogonal. In the case n = 4, the transfor¬ 
mation is either ^i-orthogonal or anti-orthogonal, or is equivalent to 

r(Z) = Z lt where Z = ( _ P 〆 :) ， Z! 

or equivalent to 

T{Z) = Zx. 



Proof (i) A triple of points, no two of which have arithmetic distance less than 
n, is equivalent to 

0 / < m > 

-I( m ) 0 

We may let the transformation satisfy 




r(o) = 0, r($) = 5, r(oo) = oo. 
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(ii) As in the symmetric case, we have 

r(Z) + r(z 1 )=r(^ + ^ 1 ). 

Again, for n 5^ 4 , we have, analogous to the symmetric case, 

T{X + iY) = X+iA r YA. 

Since Y, —Y are separated harmonically by iY and —iY for real Y, we have 

r(yt)r _1 = 

that is, 

(A^AY - 1 ) 2 = I 

for all real Y. Now we suppose m > 3 ①， then we have 

r(x+<y) = x±iF 

or 

r(^ + iY) = A ■士 *T ,_1 [1, … ， 1, -l]rYT[l, … ， 1 ， -ljr -1 . 

The first case is what we require. Changing variables in the second case we may let 

r(x +iY)=x±i[i, ... ， 1 ， -i]r[i, 

Since 

0 0 1 1 I 0 0 11 

0 0 t 1=0 and 0 0 i —t ^ 0, 

* * 
the rank is not invariant. The last case does not satisfy our requirement. 

In case m = 2 , a great deal of special consideration is needed. Apart from the 
lemma, we require the solutions of 

(A'Y^AY- 1 ) 3 = /, 

where 

W 卜 un. 

① For m = 1, the result is almost evident. 



The proof of the lemma establishes that either 


A = pl 
or 

A = pT[l, 1, 1, -l]T-> 

(the Q used here is always symmetric). 

As in the preceding proof we have four cases, 

r(X + *y) = X ± iA'YA, or Xi± iA'Y x A\ 
or Xi 士 iAlM, or X±iA , Y l A. 

By the previous argument, for m ^ 3, we have, for the first two cases, 

r(Z) = Z, Z, Z u Z u 

where Z = X + iY, and Z\ is obtained from Z by the process yielding Y\ from Y. 
Next, if r(X + iY) = X! 土 iA'YA, we have either 

r(X + iY) = X l ±iY or ^±*[1,1,1, -l 】 y 【 l, 1，1， -1]. 

Putting 



we see that both these automorphisms could render a singular matrix nonsingular. 
Thus both these cases are ruled out. 

Finally, the possibility of 

r(x + iy) = x±i^y 1 A , 


may be treated in a similar way. 
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III. Geometry of Hermitian matrices 


The geometry of symmetric matrices in the real domain is closely analogous to 
the present geometry. 

11. Notions 

We define an Hermitian pair (Zi, Z2) of matrices by 

z 2 y =0. 

A conjunctively symplectic matrix X is defined by ① 

We define a conjunctively symplectic transformation by 
(Wu W 2 ) = Q(Z U Z 2 )1. 

We identify two nonsingular Hermitian pairs of matrices (Zi, Z2) and (IVj, W 2 ) 
by means of the relation 

{Zu Z 2 ) = Q(W U W 7 ). 

It is called a point of the space. Evidently, the space so defined is transitive. The 
rank of 

(w u W2)d(Zt, z 2 y 

is defined to be the arithmetic distance of the points (VVi, W2) and (Zi, Z2). Two 
purs of points are equivalent if and only if they have the same arithmetic distance. 

JLet Pi, P 2, P 3 be three points no two of which have arithmetic distance less than 
n. Then they axe equivalent to the three points 

0, oo, K(= [1, ••- , 1, -1, ••- , -1]). 

The signature of K is defined to be the signature of the range Pi, P2, P3 (the order 
of points is significant). 

Evidently the signature is invariant under the group. We also may say that two 
triples of points are in the same sense if they have the same signature. We may 
① 5T* denotes the conjugate complex matrix of X. 


prove that if two ranges are in the same sense, there is a conjunctively symplectic 
transformation carrying one into the other. 

As to the equivalence of quadruples of points, a great deal of difficulty arises from 
the fact that the existing treatments of the theory of Hermitian forms are incomplete. 
We shall give elsewhere a complete classification and then its application to the present 
problem will be immediate. 

12. A further generalization of von Staudt’s theorem 

Theorem 12 Let r de an additive continuous automorphism of the module 
formed by all Hermitian matrices keeping rank and signature invariant. Then T is 
either an inner automorphism or an anti-automorphism (Z —» PZP'). 

Proof (Cf. the results of § 6 ) We have r ⑼ = 0. Let 

T(7) = H 0 . 

Since Ho is positive definite, we may let 

r(/) = /. 

As in the proof of Theorem 8 (in the real field), we may let 
T(X) = X 


for all real symmetric X. 

Let Y be any real skew-symmetric matrix, and let 

r(tV) = H. 


if and only if 


det(X + *K) = 0 


det(X + if) = 0, 

by Hilbert's theorem on polynomial ideals, we have an integer p such that 
(det(X + iY)y = 0 (mod det(X + H)) 


and 

(det(X + H)) p = 0 (mod det(X + iK)), 

in the polynomial ring formed from the real field by adjunction of the elements of X. 
Let X = X' = (Xij). We write 


)=/i»n + Si, 




det(X + H) =hx n + <? 2 , 






hrr = krr. 

Since 

hrr ^h-a I _ hrr kra 
Krs ^as I I kra h ta 

we have 

IM 2 = \krs\ 2 - 

We may choose Oi, ■ ■ ■ , 0„ such that the matrix 

[exp(* 6 >i), - - - ,exp(td n )]i/[exp(-i^j), - - - ,exp(-itf„)] 
has real hn, / 113 , . • • , h\ n - We may let 

hu = ku 

be real and positive. Consider 

^11 hu hij kn ku k\j 

^lt — ^li ^ii ^ij 

石 lj hij hjj I k\j kij kjj 

We have 

hij + hij = kij + kij, 

then letting hij =a + 0i, kij = y + Si, we have 

a 2 + /3 2 = 7 2 + 8 2 , a = 7 . 

Thus 0 = ±S and we have 

hi, = or k^. 

Theorem 13 A transformation satisfying the following conditions: 

( 1 ) one-to-one and continuous; 

(2) carrying Hermitian matrices into Hermitian matrices-, 

(3) keeping arithmetic distance invariant-, 

(4) keeping sense of triples of points invariant, 

(5) keeping the harmonic relation invariant 

is either a conjunctively symplectic or a conjunctively anti-symplectic transformation. 

The proof is omitted because of the similarity to the real analogue of Theorem 
8 (cf. §7). 
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IV. Geometry of rectangular matrices 

13. Subgeometries of the geometry of unitary matrices 

The geometry studied in III may also be interpreted as the geometry of unitary 
matrices. Since the matrix 


is of signature 0, we may use 


instead of it. Then the pair of matrices (Z\, Z2), satisfying 

(H)(: ^(Z 1 ,z 2 y = 0, 

that is 

ZiZ[ - z 2 z' 2 = o, i-(z^ZzKz^ZiY = 0, 

is the homogeneous representation of a unitary matrix. We can generalize the idea a 
little further. Let 



The matrix F( m+n ) satisfying 

rff 2 r , = d 2 

is called conjunctive with signature (n, m). The pair (Z\ n \ Z^ n ’ m )) of matrices satis¬ 
fying 

(Zi, Z 2 )S 2 (Zi, Z 2 y = 0 

is called an (n, m)-unitary pair. 

Instead of going into the details of this geometry we shall be content to make 
the following remark. 

Let 



Then we have 


WiW{ = I. 



Wi is formed by m columns of a unitary matrix. Thus the geometry may be coi^idered 
as a subgeometry of the geometry of unitary matrices by identifying the elements with 
the same m columns as an element of the subgeoxnetxy. This may be described in 
short as “the process of projection” • 

14. Remarks 


(i) The condition “one-to-one” is redundant, since the invariance of arithmetic 
distance implies it. 

(ii) The continuity for the real case is also very probably redundant (cf. Sierpin- 
ski’8 contribution to the solution of the functional equation f(x + y) = f(x) + f(y))- 

，:::、 n r na；rc nt mofrioac (with the group given in 










Geometries of matrices. I” Arithmetical 
construction* 


A discussion of the redundancy of the conditions involved in the generalizations 
of von Staudt’s theorem should be preceded by a study of the involutions. As an 
illustration and a supplement to part I, we give here a discussion of the geometry of 
2-rowed symmetric matrices; we intend to treat the general case at a later occasion. 


the invariance of arithmetic distance. For the real case, even continuity (as well as the 
condition concerning the signature) is redundant (a proof for even order symmetric 
matrices has been obtained). As to the general discussion, some knowledge concerning 
involutions seems to be indispensable; the author will come back to it later. 

Throughout the paper, the notations in I are taken over and we assume that 
n = 2. 

1. Normal subspaces 

Theorem 1 Given two matrices Z\ and Zi with arithmetic distance r(Z\, Z^) 
=1, the points Z satisfying 

r(Z, Z{) ^ 1, r(Z, Z 2 ) ^ 1 


form a normal subspace. 

Proof Without loss of generality, we may take 




Then we have 

(a ： ± l)z -y 2 = 0, 


* Presented to the Society, April 28, 1945; received 如 the editors January 2, 1945. Reprinted 







56- 


华罗庚文集代数卷 II 


of the first subspace is at distance 1 from X. This contradicts our hypothesis. Thus 
the second is spanned by 

(K), (: A) 

The theorem follows. 

Consequently two complementary subspaces have a unique matrix in common. 
Theorem 3 Let & denote the set of matrices S such that 

r(S, P) = 2, 

where P is the common matrix of two complementary normal subspaces. Then 6 is 
transitive under the subgroup leaving the two subspaces invariant. 

Proof Let the two complementary subspaces be 



and S be the points in &. By the hypothesis with P = 0, we know that 5 is nonsin¬ 
gular. The transformation 

Zi = z{-s~ l z + iy 1 

carries S into oo, and it leaves the subspaces 



invariant. 

2. Direct sum of complementary subspaces 

Definition The subspace formed by points of the form 

(o：) 

is called a (completely) reducible subspace which is the direct sum of the subspaces 



Now we shall give its arithmetic construction. We take, by Theorem 2, / as any point 
of the set ©. Find P satisfying 






Then fta = 0, Ai = (1 + p)y(l ~ I )> ^3 = (1 + 1/P)®(1 — y). 

The matrices of the form 

1 + (1 +p)y(l -x) 0 

0 1 + (1 + l/p)x(l - y) 

run over all matrices of the reducible space. 

Since we use the arithmetical notion only, we have the following general defini¬ 
tion. 

Definition Given two complementary subspaces X and 2), let Q be a point 
of the set 6. The reducible space (or direct sum of both subspaces) is defined by the 
aggregate of points K such that 

r(P + ,K) = r(P-,K) = l t 





- 58 - 


华罗庚文集代数卷 II 


where P + and P- are both solutions of 

r(Q, P) = r(P, X) = r(P, Y) = 1 

and X and Y belong to X and 2) respectively. 

As a consequence of Theorems 2 and 3 we have the following theorem. 
Theorem 4 The aggregate of reducible spaces is transitive. 

3. Involutions 
Given 

z=( Zl Z2 

\ Z2 23 

we wish to find all matrices 

x 0 \ 

0 y ) 

of a reducible space such that 



consequently 

(x - zi)(y - z 3 ) - = 0. 
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4. Commutative involutions 













-60- 


华罗庚文集代数卷 II 


If 5 carries ©o into itself, but not pointwise, then 

SoS= SSo- 

In fact, leaves So fixed pointwise. Thus 

5 _1 5o5i = 士 So, 


the upper sign is ruled out, since ff does not leave So pointwise fixed. 

Theorem 5 Any pair of commutative involutions of the first kind may be 
carried into ffi and S 2 simultaneously. 

Proof 
(4). Then 


P 

v k 2 P' 

and we have 
P = 

where 


The first one may be assumed to be 5i- 

Let the second be given by 


(1 

0 

0 

0 ) 


f 1 

0 

0 

0 \ 





0 

-1 

0 

0 


0 

-1 

0 

0 

( 

P 

-Kx \ 

- j 

0 

0 

1 

0 


0 

0 

1 

0 

I 

k 2 

P， J 



0 

0 

-1 ) 


(0 

0 

0 

-1 J 




( 0 

Pi 

V 

Ki 


0 k x \ 

, Ko = 

( c 

1 

M 

, 

V P2 

0 

) 


—*1 

0 / 



V -*2 

0 ) 



PiP2 + kik2 = 1. 


If 


I Pi~ ^ -^l j 

V k 2 P2- X ) 

has only simple elementary divisors, we have a, b, c, d such that 
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0 AO 0 \ 

1/A 0 0 0 

0 0 0 1/A 

< 0 0 A 0 / 

(notice that the transformation carries Ji into —5i, but they denote the same t 
formation). Further 

, 1 0 0 0 \ 

0 A 0 0 

0 0 10 

\ 0 0 0 1/A > 

carries 5 into 52. If 



has a double elementary divisor, we may take 


V *2 



Now we have to consider the case 
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niBaBwjwMHBy 

: </'/：*；?*j - ->v 'yi\ 7 /i\ 




has two isolated fixed points with arithmetic 



points, with arithmetic distance 2 , will serve 
•lution of the second kind, which is uniquely 
i oo be fixed points, then the transformation 


[ZA\ A 2 = 1. 








土 /， 







then 0 would not be an isolated fixed point. 


them with arithmetic distance less than 2 , are said to form a harmonic range, if the 
involution determined by Z\, Z^ permutes Z 3 and Z 4 . 

Analytically, we let 

(Zi, z 2 , z 3 , Z 4 ) = ((Zi - Z 3 )(Z! - Z^r l ){(Zi - Z 3 )(Z 2 - Z t )- l )~\ 



(Z- Z{)(Z- Z 2 )~ l = -(z •- Z X )(Z* - 為 )- 1 







carries into Z 4 . Evidently 


{Z\, Z 2 , Z 3 , Za) — -I. 

This condition is sufficient as well as necessary. 

Thus the "invariance of arithmetic distances" implies the invariance of har¬ 
monic range”. 
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1. Introduction 

Elsewhere® the author established the symplectic classification of Hermitian ma¬ 
trices, which has applications to the geometry of symmetric matrices. It is the purpose 
of the present paper to treat the analogous problem： the orthogonal classification of 
Hermitian matrices. In other words, it may also be described as the quasi-unitary 
classification of symmetric matrices. Besides their own interest, the results of the 
present paper have applications in the geometries of skew-symmetric and symmetric 
matrices, as well as in the theory of automorphic functions of a matrix variable. 

It should be noted that the method previously used is only applicable to the 
present problem for matrices of even order. In order to establish the general solution., 
we introduce here a different method. 

Unless the contrary is stated, throughout the paper, capital latin letters denote 
n-TOwed square matrices. Further let M, and M respectively denote the transposed 
matrix and conjugate imaginary matrix of M. I denotes the identity matrix. 

2. Statement of the problems 

Two Hermitian matrices H and K are said to be conjunctive orthogonally, if 
there is an orthogonal matrix P such that 

PHP 1 = K. (1) 

FVom (1) and the orthogonality of P we deduce immediately that 

PH HP' = KK. (2) 

Therefore, if two Hermitian matrices H and K are conjunctive orthogonally, the 
elementary divisors of the characteristic matrices of HH and KK are the same . 

In the converse, two problems arise: 

1. Does HH take any prescribed elementary divisors? The answer is in the 
negative. Then we ask further: 

* Presented to the Society, February 23, 1946; received by the editors, October 18, 1945. 
Reprinted from Transactions of the American Mathematical Society, 1946, 59(3): 508-523. 

① Amer. J. Math., 1944, 66: 531-563 
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2. For a given admissible set of elementary divisors, is there a unique H, apart 
from orthogonal conjunctiveness, such that HH takes the given set as its elementary 
divisors? 

The answer is also in the negative. More definitely, the elementary divisors of 
~HH characterize the orthogonal conjunctiveness of H with a reservation about the 
uncertainty of signs related to “signature” • 

More generally, apparently, we have the following problem. Two pairs (H, S) and 
(Hi,Si) of matrices, where H and Hi are Hermitian and S and S\ are symmetric, 
are said to be equivalent if there exists a nonsingular matrix P such that 


~PHP , -- 

=H X 

(3) 

and 



PSP 1 = 

= Si- 

⑷ 

Consequently, we have 



phs^hs^T- 1 ： 


⑹ 


if Sis nonsingular. Hereafter we discuss only the case with nonsingular 5. Analogously, 
we have also two problems. 

3. Solution of problem 1 

Theorem 1 Let T be a symmetric matrix. The equation 

~SH = T ( 6 ) 

is soluble in an Hermitian H, if and only if the elementary divisors of T have the 
following two properties: 

(i) Each elementary divisor corresponding to a negative characteristic root must 
occur an even number of times; 

(ii) Complex elementary divisors must occur only in complex conjugate pairs. 
Analogously, we have 

Theorem 1’ The equation 

HS- 1 1TS^ = Q (7) 

is soluble in symmetric S and Hermitian H if and only if the elementary divisors of 
Q satisfy (i) and (ii) of Theorem 1. 
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Equivalence of both theorems. If S~ l = IT' in (7)，we have 


(^Hr^rUT) = r'Q^)- 1 . ⑻ 

Thus, Theorem 1 ; implies Theorem 1. Conversely, since we have F such that rQ ^) -1 
= T, Theorem 1 also implies Theorem 1’. 

The sole difficulty lies in proving the property (i). 

Proof of the theorems (1) Suppose that 



where q\ and have no characteristic root in common. Let 


⑼ 


H 


hi 



( 10 ) 


From ( 6 ), (9) and (10), we have 

h x h\ + kk' = qi, ( 11 ) 

hik + kh 2 = 0 (12) 

and 

k'k + A 2/12 = 92 - (13) 

FVom (11), (12) and (13), we deduce immediately 

qik = h\hik 4 - kk'k = —hikh ，2 + kk'k = k{hzh 2 + k'k) = kq 2 - 

Since 9 i and qi have no characteristic root in common, we have A: = 0. Thus ( 6 ) 
is soluble if and only if both equations 


hihi = qi, hiht = ga 


are soluble in Hermitian hi and A 2 - Thus we have to consider only either that T has 
a real root or that T has a pair of conjugate imaginary roots. 

(2) Now we introduce the notations: 


jM 


0 0 -0 1 
0 0--1 0 


=(o«). 


\ 1 0-0 0 / 
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where anj = lifi + j= p + l and 0 otherwise. Let be the ordinary Jordan's 
canonical form ① of a matrix with elementary divisor (x— a) p . Then jj a is symmetric. 
It is well known that there exists a real polynomial f(x) such that 

/ 0 i ) 2 = Ji- 

Let s = o^^jfiji) which is symmetric, since 

jjt=j[33r l =-=3T3- 
It can be verified easily that s satisfies 

sjs = ajji. (14) 

(3) Now we consider the case thet T has only a pair of conjugate imaginary 
characteristic roots. Evidently the elementary divisors of HH — XI and HH — XI are 
the same; the solvability, therefore, implies the property (ii) of the theorem. 

Conversely, the pair of symmetric matrices (/,T) is congruent to a pair of ma¬ 
trices (Ti, Ta) which is a direct sum of the following pairs: 

(j 0 ) ( ajji 0 ) 

、 0 j . J 〈 0 初 ! 人 

Let 

rr # = Ti, irr 7 = t 2 . 

Then, (6) implies 

(THr , ) = T 2 (15) 

and conversely. 

Since Ti is real and = /, the solvability of (6) is equivalent to that of 
HTiH = T 2 . 


Since the equation 

( 0 s 1 ( j o 1 ( 0 s U 咖 0 \ 

( 吞 0 八 0 j ■ 八 5 0 厂 、 0 ajji ) 


mple, ii 2) = ^ q 
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is equivalent to sja = ajju which is soluble in s by (14), the theorem is proved for 
the case with complex characteristic roots. 

(4) The easiest case is that in which T has a positive root. Since we may assume 
that (TijTj) of (3) is a direct sum of 


(J»_i)，a > 0, 

by (14), there exists a real s such that 

sjs = ajji 


and the theorem is now proved. 

(5) Finally, there comes the difficult case that a is negative. Without loss of 
generality, we may assume that a = —1. 

Since, by (14), we have a symmetric matrix s such that 

f 0 3 \ ( j 0\ f 0 0 \ 

\s 0 ) \ 0 j ) \ s 0 )~{ 0 -jjx ) 5 

we see that if (i) is satisfied, (7) is soluble. The sole difficulty lieB in the converse. 

(6) Now we write 


T 0 =/**>+•• (16) 

.•斗 (17) 

Let K = HTo, then the equation 

HT 1 H = T 0 (18) 



KTqTiK = I. (19) 

From (19), we deduce immediately that 

KToTi = KToTjCTo^K = T 0 T X K. (20) 

Let 

K = (k ij ), ^ = k^' ai) . 

FVom (20), we deduce 

( 21 ) 
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It is known that 

f aij bij Cij …、 

kij = 0 仏 j ， if = (22) 

V.°*j / 

and in case aj > s/t, we add Sj — Sk rows of zeros below the matrix of the form (21) 
and in case Sj < s*，we add a* — Sj columns of zeros before the matrix of the form 
( 21 ). 

FVom (19)，we find 

3 

Now we consider the element at the (1,1) position of the case Si = 9k- By (21), 

= S ik I^"l (23) 

By the constitution of the first column of j-\, we need only consider the element at 
the (1,1) position of kijkjk- It equals zero, if < a *： and if Sj > s*. Thus the sum 
(23) runs only over all those j's with 8j = s*. 

Let s be any integer occurring in the set «!,••• ,s e . Without loss of generality, 
we may assume that 

Si = a for 1 ^ t < m, Si ^ s for m < i 彡 e. 

Then, from (23), we deduce 

— ^ OijCijk — Sik, 1 彡 i, fc 彡 m, 

片 i 

that is AA = -/("») if we let A = Taking determinants of both sides, 

we have |d(^)| 2 = (—l) m , which can not hold for odd m. 

The theorem is now proved. 

4. Square root of an orthogonal matrix 

Now we require a result very likely due to Hilton, but his paper is not available 
here. Accordingly we state the rediscovered result in the following without proof. 

We say a matrix P is orthogonal with respect to a nonsingular symmetric matrix 

5, if 

PSP，= S. 
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rticular, for 5 = /, we omit the phrase “with respect to P. 


orthogonally, if there exists a matrix P orthogonal with respect to S such that 4 = 
PBP~\ 

Theorem 2 (Hilton) Every orthogonal matrix is similar orthogonally to a direct 
sum of orthogonal matrices with elementary divisors either of the form 

(i) (x - a) r , (x - l/a) p , 
or of the form 

(ii) (x ± l) r , for odd r. 

For the first case, the matrix orthogonal with respect to 

D 

is similar to the matrix 


orthogonally, and for the second case, the matrix orthogonal with respect to 



is similar to 



U^lY o o 

v 10 

- 傲 -w u^lr 1 


(25) 


orthogonally, where v=(0,0,-• - , 0 ， l)and 2p+l = r. 

Evidently, the squares of the matrices M and N are matrices Mi and Ni, or¬ 
thogonal with respect to S and Si, with elementary divisors 
(i ; ) (x- a 3 ) r , (x- a~ 2 ) r 

and 

(ii') (x + l) r , 

respectively. 

With a slight modification we have the following assertion: for given orthogonal 
Mi and N\ with elementary divisors (i') and (ii’ ） respectively, we can find orthogonal 








ces M and N with the elementary divisors (i) and (ii) respectively and with 
the right half-plane in the plane of complex numbers. Thus, except when a is 
ive, an orthogonal matrix has a square root matrix. 

rheorem 3 Let R be an orthogonal matrix without a negative root and suppose 
ue have an Hermitian matrix H such that 


RH = H 节 . 


(26) 




there exists an orthogonal matrix Q with characteristic roots on the right half- 
such that Q 2 = R and 

QH = H^. (27) 


We may assume that A is a direct sum of the matrices 


i2 = ri + r 2 + ••• -i-r e , 


where Ti has elementary divisors either of the form (i’）or of the form (ii 7 ). They are 
denoted typically by Mi and Ni respectively. We construct M and N accordingly; 
they are denoted by ?i, ••- , q e . Let 

Q = 9l+®+* - •+9e- 

Then, evidently, we have Q 2 = R. 

Further, from 

RH = H^, 

we have 

rihij = hijf'j, (28) 

and if this implies 

Qihij = (29) 

we have QH = HQ' and consequently we have the theorem. 

In order to verify that (28) implies (29), we need only verify that 

M 2 X = XM 2 and N 2 X = XN 2 


implies 


MX = XM and NX = XN, 


respectively, where M and N are defined exactly as in (24) and (25). Straightforward 
calculation establishes both implications. 
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5. Solution of the second problem 

Theorem 4 If HH = KK, where H and K are both Hermitian, then there 
exist two matrices P and Q such that 

: f ^ ° 1 

VO h2 ) 


二 )_ 

Proof Since K~ l H — R is orthogonal, we have an orthogonal matrix T such 

that 

r_Isr =(o 

where r*i contains all negative characteristic roots of /Zi. Then, we have 
H! = 

where Hi = HT, Ki = KT are both Hermitian. Since rj and fa have no 


characteristic root in common, we have 
^1 = ( 


ifci 0 
0 k 2 


and 


丑 i = 


hi 0 
0 


and 


kiri = f^ki 


kiT2 = r^fc2. 

By Theorem 3, we have an orthogonal 92 such that = r i 

hi = k2<^ = 名 ^ 292 . 

Further we have an orthogonal matrix gi such that g? = —n and 
hi = -kiq\ = -^ikiqi. 

The theorem follows. 
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6. Explicit result 

The result previously obtained may be concluded in the following theorem. 
Theorem 5 Let H be a nonsingular Hermitian matrix and S be a nonsingular 
symmetrix matrix. The elementary divisors of the characteristic matrix of 


HSU 1 


(30) 


are of the following three types: 

(i) (x — a)^, for real a > 0; 

(ii) (x - 0) x , (x - 0) x , for real 0 <0 

and 

(iii) (x — <y) 入， (x — 7) a , for complex 7. 

Let ai = a 1 ’ 2 ， i(—/3) 1 〆 2 and 71 = 7 1 / 2 ; the determination is taken on the 
right half-plane. Then the pair of matrices (H,S) is equivalent to ① 




(31) 


In the theorem we have to justify only a point that the pair of matrices 

is equivalent to 

Hi t )-(; :))■ 

In fact ,we have a nonsingular matrix 7 such that = — jj^and 7 j 7 , = j. Let 



p U 


① Y2' denotes direct sum. 
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and 



The situation of the indeterminate signs corresponding to a positive root or to 
a pair of conjugate complex roots will be clarified by introducing the concept of 
signature system. 

Suppose that has an elementary divisor {x — a) x (a > 0) repeated 

m times. Let p and q be the number of positive signs and negative signs appearing 
in the expression (31) corresponding to the elementary divisor (x - a) x . Then (p, q), 
p + q = m, is called the signature corresponding to the elementary divisor (x — o ： ) 入 . 
Similarly we define the signature corresponding to 

(a:-7) A (*-7) A 

for complex 7 . The totality of elementary divisors and their corresponding signatures 
is called the elementary divisors with signature system of the pair of matrices (H, S). 
Evidently, (H, S) and {Hi, Si) are equivalent, if they have the same elementary divi¬ 
sors with the same signature system. The converse of this statement is also true. The 
proof of this fact can be constructed by adapting the results to be obtained in §§ 8-10 
and the method used in ( 6 ) of §3. Owing to the similarity, we give here no details of 
the proof. 

7. Anti-involutions 

An orthogonal matrix T satisfying 

TT = I (32) 

is called an anti-involution of the first kind, and that satisfying 

TT=-I (33) 

is called an anti-involution of the second kind. 

Since TT' = I, it is evident that the involution T of the first kind is an Hermitian 
matrix. By Theorem 5, we have an orthogonal matrix P such that 

尸 7^ = [1 ， ... ，1， -1 ， .- ， -1 】 

which is a diagonal matrix. 
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In case of the second kind, iT is Hermitian, the roots of 

|(ir)(i?) - a/| = o 


are all equal to —1. The case can only happen for n even. There exists an orthogonal 
matrix P such that 

0 i/W \ 

0 J ' 


P(iT)P = 


where n = 2p. 

8. Automorphs 

Now we are going to study the group formed by all the matrices P satisfying 

PSP 1 = S (34) 

and 

PHP 1 = H. (35) 

By Theorem 5, we may write 

s = s^+si q \ 
h = h\ p Hh^\ p+q = n ' 

where the two matrices 

and IhSn'S; 1 

have no characteristic root in common. FVom (34) and (35)，we have 

PHS~ l = HS~ l P. (36) 

It follows immediately that P is decomposable as 
P = 


f 0 . 
、 0 中 . 


Therefore, without loss of generality, we treat the problem with (i) HS -1 !! S 
having only a pair of conjugate roots, (ii) HS -1 !! S having only a positive root, 
and (iii) HS - 1 l?S -1 having only a negative root. 

We shall discuss the cases separately. 
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9. The case with a pair of complex roots 


Now, without loss of generality, we take 


5 = 

l<t<e \ 


j ⑷ 

0 



and 


E ,=ti 


0 



where > «2 > • • • ^ Putting 

j m = 

l<i$e 


and 

4 P) = 乙'(幻4**))， 

we treat, without loss of generality, the case that 



and so on. Then, from (36)，we have 


(A 


f 0 JyJ \ 

0 J,J\ 


U 


o )" 

—U 7 7 o j 

\ c D ) 


that is 


B^-fJ = J-yJC, AJyJ = J-fJD, 
DJ^J = JyJA,CJyJ = JyJB. 


(37) 


Then, we have 

, B{J^J) 2 = (J 7 J)C(J 7 J) = (J,J) a B. 

It follows that B = 0 since the characteristic root of (J 7 J) 2 is equal to 7 2 and 7 2 ^ 7 2 . 
Consequently (7 = 0. 

Further, from (37), we have 


A(J-,J) 2 = (J y J) 2 A 


(38) 
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and 

D= (J 7 J) _1 ]4(J 7 J). 


EVom (34) we deduce 


AJA' = J. 


Conversely, for any A satisfying (38) and (40), the matrix 


P 



A 

0 




satisfies our requirement. 

Then we need to find those A satisfying both (38) and (40). 

Let 

A = (I + Q)(I-Q)- 1 

(the exceptional case |/ + A| = 0 is negligible in counting of parameters). 
From (38) and (40), we have 


Q(7 7 J) 2 = (J^JfQ 

and 

QJ + JQ' = 0. 

Let QJ = K, which is a skew symmetric matrix and which satisfies 
K{JJ^) 2 = 

(notice that JJ~,J = Jy). 

Putting 

K = fcii= *4”） 

and 

kij = 


(39) 

(40) 


(41) 


(42) 


we have 
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It follows immediately that 


*i x 2 ■■- ar Sj _, x aj 
X 2 x 3 ••- x Mj 0 


hj 


x aj 0 
0 0 


0 

0 


which contains 内 rows, if Sj < with addition of a sufficient number of zero rows, 
and which contains 2sj parameters (a complex number is counted as two parameters). 
Further in case i = j evidently ku equals zero. 

Therefore K depends on 


2s 2 + 2 s 3 + • • • +2s e 

+2*3 + … +2» e 


+2s e 

= 2s2 + 4s 3 + • • • +2(e - l)s e 


parameters. 

10. The case with a positive root 


Now we take 


s= 

l<*<e 

H= [ (士 jW )， 

i<i< e 


where si ^ S 2 ^ ^ s e . Let 


a > 0, 


P = (J + Q)(/-Q)- 1 . 
We find, from (34) and (35), that 

QS + SQ' =0 


and 


QH + HQ' = 0. 
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Putting QS = K, which is skew symmetric, we have 
KS~ l H - HS^K = 0. 
Since a ^ —a, we find that K is real. Let 


(43) 


K = {k ij ), kij = kj^. 


= (JW 

In a manner similar to that of §9, it can be shown that the number of parameters is 
equal to 

2s 2 + 4*3 + • • ■ + 2(e - l)s e . 

11. The case with a negative root 
Now we may let 

uO )， 




where /3 > 0. Let 


: E ? ⑹， J 0= H Up- 

l«e K«<e 

We may consider, without loss of generality, the case 


J 0 
‘ 0 J 


H = 


0 J p 
~ J 0 0 , 


Letting 


and 


P = (/ + Q)(/-Q )- 1 

. O' 


which is skew symmetric, we find, from (36), that 


K\ 


0 JJ 0 
-JJp 0 / 


0 J 0 J 
-J 0 J o t 


(44) 
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Let 


k=[ k \ l 

(45) 

V -L' K 2 ) 

Prom (44), we have 


LJ3^ — J 玲 JL ， L J — J 卩 JL， 

(46) 

KiJJff = JpJK 2 , K 2 JJ 0 = J/ 3 JK 1 . 

(47) 

Putting ~LJJp = T, which is Hermitian by (46), we have 


T(JJ 0 ) 2 = 

(48) 

K^JJp ) 2 = {JpJfK, 

(49) 

and 


K 2 = 

(50) 


Consequently, for T and K\ satisfying (48) and (49) and defined by (50), we 
have K defined by (45) to meet our requirement. 

The number of parameters of Ki is equal to 


282 + 4s 3 + ■■■ + 2(e — l)s e 
and the number of T is equal to 

2 S 2 + 4»3 + • • • + 2(e — l)s e + Sl + S 2 + • • • + Se. 

The total number of parameters of K is, therefore, equal to 
+ 5«2 + 9^3 +. •. + (4e — 3)* e - 
12. Automorphs (continuation) 

As a consequence of the previous results we have the following statement: let 
the roots of HS -、 H 5 1 be 

ai, ••- ,ox (a > 0) 

with the multiplicities 

Pi, … .P a ； 

0u ... 自 1 ，…，心 complex) 
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with the multiplicities 


and 


with the multiplicities 


9i .• - .9p； '' .9/* 

7i .• - .7i/ (7 < 0) 
2j*i ，• ■ ■ , 2 r„ 


respectively. Then the group of automorphs of {H,S) depends on at least 


n + ^2 H - \-r u 


(51) 


parameters (evidently, this is a best possible constant). 

Theorem 6 Given A and an Hermitian matrix H, the symmetric matrices S 
satisfying 

-XJ) = 0 (52) 

depends on n(n + 1)-1, n(n + 1)-2, and n(n + 1)-3 parameters according as X is 
positive, complex, and negative respectively. 

Proof The group of conjunctive automorphs of H, that is, the group formed 
by T satisfying 

THT' = H, 

depends on n 2 parameters. Now we ask what is the number of parameters of distinct 

rsr'. 


Since the matrix T satisfying 

rnr , = h, tst' = s 

depends on not more than n H - h r„ parameters, if the roots of HS~ l H S ^re 

given at the beginning of the section, the totality of different symmetric matrices 
TST' depends on n 2 - ri — - r v parameters. 

For a given positive A, the manifold formed by S (varying all the other roots) 
depends on n 2 4- (n -1) parameters. Similarly we have the result for a complex root. 

For a given negative root, the number of parameters of other roots is equal to 
n-2. Farther the different symmetric r < ST , , for all roots being given,depends on at 
most n 2 - 1 parameters (in case all the others are non-negative n 2 - 1 is the exact 
number). Thus the total number of parameters is equal to n 2 — n — 3. 

Finally, the author wishes to express his warmest thanks to the referee for his 
help with the manuscript. 








Geometries of matrices. II. Study of involutions in 
the geometry of symmetric matrices* 

1. Introduction 

The paper contains a detailed study of the involutions in the geometry of sym¬ 
metric matrices over the complex field. It is one of the aims of the paper to establish 
the following theorem: 

A topological automorphism of the group formed by the symplectic transforma¬ 
tions is either an inner automorphism or an anti-aymplectic transformation. 

More precisely, we identify two symplectic matrices T and —I as a symplectic 

transformation To- A continuous automorphism of the group formed by To is either 

of the form ^3oTo^Po 1 or 屯 where ^Jo denotes a symplectic transformation 
and Tq i 8 the conjugate complex of Xo- 

The following result, which can also be derived from Mohr’s results® on the 
representations of the symplectic group, can be obtained as an immediate consequence 
of our present theorem: Every topological automorphism of the group formed by 
all symplectic matrices (that is, we do not identify 1 and —I) is either an inner 
automorphism or the conjugate complex of an inner automorphism. Actually, by 
means of the method used in the paper, an independent proof of this result can be 
obtained which is much simpler than that of the first theorem, since the distinction 
between involutions of the first and the second kind now is apparent. 

In the course of our discussion, we find the explicit normal forms of involutions 
and anti-involutions. The manifold of the fixed points of all sorts of involutions has 
also been determined completely. 

As an introduction, several types of geometries keeping an involution or an anti¬ 
involution as absolute have been enumerated. Those obtained from anti-involutions 
are generalizations of non-EucIidean geometries and those obtained from involutions 
give us several new types of geometries, whose real analogy (which will be given 

* Presented to the Society, December 29, 1946; received by the editors February 27, 1946. 
Reprinted from Transactions of the American Mathematical Society, 1947, 01: 193-228. 

① Gottingen Dissertation, 1933. The author is indebted to the referee for this reference, but 
unfortunately it is not avEdlable in China. 
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elsewhere later) is a generalization of Mobius geometry of circles. 

Furthermore, the author shows that every symplectic transformation is a product 
of two involutions and four anti-involutions, and that for n = 2 <t t, t odd, in the space 
of symmetric matrices of order n, we have at most it + 3 pairs of points of which any 
two pairs separate each other harmonically. 

Algebraically speaking, the last result is equivalent to the following one: let 
Xi, be symplectic matrices of order 2n satisfying 

1 2 i = -D, TiXj = -Zjli, 

where 3 denotes the 2n-rowed identity. Then s < cr+3 and this maximum is attained. 
As a by-product the author establishes also that if 5i, ••- , 5, are n-rowed symmetric 
matrices satisfying 5? = -/, SiSj = -SjSi, then s < a+l. In case of skew symmetric 
matrices, we have s < cr. These maximums are all attained. 

As in I ①’ capital latin letters denote nxn matrices unless the contrary is stated. 
On the other hand, we use Af(*， m ) to denote an / x m matrix and = M( TO ’ m >. 
/ and 0 denote the identity and zero matrices respectively. We use also 



which are 2n-rowed matrices, p and q denote two integers satisfying p + q = n. 

2 . Classification of involutions 

First of all we identify the transformations which have the same effect in the 
space of symmetric matrices. 

Theorem 1 In the space of symmetric matrices (in homogeneous coordinates), 
two substitutions 



⑴ 

and 


Qo(Wi,W 2 ) = (Z 1 ,Z 2 )5o 

(2) 

induce the same mapping of the space, if and only ij 


X = ±So- 

(3) 


① The first paper of the series will be referred to as I (Trans. Amer. Math. Soc., 1945, 57: 
441-481). 
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In particular, (1) carries every point of the space into itself, if and only if 

I = ±3. 

Proof It is sufficient to establish the second part of the theorem. Putting 
Zi = 0, we have, by the supposition, Wi = 0. Hence C = 0, if we put 


Similarly, putting ^ = 0, we find B = 0. Now the transformation becomes 
W = AZA' 

in the nonhomogeneons coordinate system. By the assumption, 

Z = AZA! 

holds for all Z, consequently A = ±1. 

From Theorem 1, we deduce at once the following theorem. 

Theorem 2 A symplectic transformation 

Q(W 1 ,W 2 ) = (Z 1 ,Z 2 )% t=^ B D ^j 

is an involution if and only if 


(A 

B \ 

_ ± ( D' 


u 


V - c， 

^ J 


that is, *13 is either skew symmetric or symmetric according os X 2 = 3 or -3. 

By an involution, we understand a symplectic transformation whose square 
induces the identity mapping. Similarly, we define an anti-involution as an anti- 
symplectic mapping whose square induces the identity mapping. 

Theorem 3 An anti-symplectic transformation 

Q{W 1 ,W 2 ) = {Z i ,Z 2 )% 1= ^ ^ ⑺ 

is an anti-involution, if and only if 


(A 

b y 

’一土 y 

- b， \ 

U 



A' J 
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that is, %'S is either a skew Hermitian or an Hermitian matrix according as IX* = 3 
or —3_ 

Proof FVom (7) and 

Q'(Z 1 ,Z 2 ) = (P 1 ,P 2 )1, ⑼ 

we have 

Q*QW,iv 2 ) = (p 1} p 2 )*rx. 

This represents the identical mapping, if and only if = 士 3. Further, since % is 
symplectic, that is = S, we have 

T5 = T(1S^) = ±5^ =干 CTff)' 

The theorem is now evident. 

Definition 1 The involutions satisfying I 2 = 3 are called involutions of the 
first kind and those satisfying X 2 = —3 are called involutions of the second kind. 

Definition 2 The anti-involutions satisfying XT* = 3 are called anti-involu¬ 
tions of the first kind and those satisfying IT* = —3 are called anti-involutions of the 
second kind. 

3. Normal form of involutions 


Suppose that T is an involution of the first kind. By Theorem 2, is skew 
symmetric. Now we consider the pair of skew symmetric matrices The 

elementary divisors of the matrix — A5 = (!I — tire those of X — A3. Since 





he transformation 


W = HZH. 


( 10 ) 








B 


have the following theorem. 

m 4 Every involution of the first kind is equivalent to (10) symplecti- 
is a diagonal matrix with p positive Is’ and q negative Is' and p ^ q. 
wo of these involutions are equivalent. 


The last sentence can be justified by considering the multiplicity of the charac- 



L of the symplectic matrix. 

ion 3 This involution is said to be of signature (p, q). 


Now we consider an involution X of the second kind. Then we have a pair of 
matrices the first matrix is symmetric and the second skew symmetric. The 

characteristic roots of 1 are ±i and the elementary divisors are all simple. Let® 


d(<r-A3) = /(A). Since Tff = we have 


/(A) = d(T - A 3 )d(ff) = diyi '- 1 - A 3 ) 

=叶普俨/⑴. 

Hence the multiplicities of the characteristic roots i and —i are equal. 
Since 



has the same elementary divisors as X5 — AJ, we have a symplectic matrix ?p carrying 
T into 



which corresponds to the transformation 


W = -Z~ 


( 11 ) 


Hence, we have the following theorem. 

Theorem 5 Every involution of the second kind is equivalent to (11) symplec- 
tically. 

Remarks (1) For n = 1, no involution of the first kind exists. 

① d(X) denotes the determinant of the matrix X. 
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(2) The following normal form of an involution of the second kind is sometimes 
useful: 

W = -Z. (12) 


(3) In the case n = 2p, we sometimes use 

-( ； . : T) 


(13) 


as the normal form of an involution of the signature (p,p). 

Evidently, two simplectic transformations with matrices and I 2 are commu¬ 
tative if and only if 

XiT 2 = ±^ 1 - (14) 


Consequently, we see that the product of two commutative involutions Xi and X 2 of 
the fiist kind is an involution either of the first kind or of the second kind according 
as X 1 I 2 = X 2 ^i or T 1 X 3 = -^ 2 ^ 1 - In particular, for n even, n = 2 p, say, (11) may 
be regarded as a product of two involutions of the first kind: 

7 ：>0 T ) 



For n odd, such a decomposition does not exist. 

Further, any involution of the first kind is a product of involutions of the first 
kind of the signature (l,n - 1), which are called fundamental involutions. 

4. Equivalence of anti-involutions 

A little attention should be paid to the equivalence of anti-symplectic trans¬ 
formations. Let {Wi,W2) = Q(Z\, - FVom {UuU-i) = Qi(Zi,Z2)^i,(Vi,Vz)= 
Q 2 {W u W 2 )% we have 

{y x ,Vi) = Q 2 0^， 的冲 =Q 2 Q (玄 1 ，為)邱 = Q2QQT l (Ui,U 2 )V^V- 


Thus we have the following definition. 

Definition Two anti-symplectic transformations with matrices and .%2 o.re 
said to be equivalent, if there exists a symplectic matrix V such that 


= I 2 . 


(15) 
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Notice that if (JTi，％) and {Zi,Z^) (and (Vi, V^) and {W\,W 2 )) are related anti- 
symplectically, then we have = I 2 instead of (15). 

5. Relation between anti-involutions and hypercircles and the normal form 
of the anti-involution 

Let ft be an Hermitian matrix. The points (Zi, Z-i) of the space for which 

(Z^Z^ZuZi)' (16) 

is positive definite form a hypercircle, ft is called the matrix of the hypercircle. The 
skew matrix 

(17) 

is called the discriminemtal matrix of Sj. 

For an anti-involution of the first kind %, we have a hypercircle with the matrix 

(18) 

in fact, since T is symplectic, and for an anti¬ 

involution of the second kind %, we have a hypercircle 

Iff. (19) 

Their discriminantal matrices are, respectively, 

(tX5/5(tT5)=5 3 = -5 

and 

(W 卿) = -沪 = 

Theorem 6 To each hypercircle with discriminantal matrix —5 or there 

corresponds an anti-involution of the first and the second kind respectively. 

Proof (1) Let Sj be an Hermitian matrix satisfying = —5. Let 

1 = ifjd- 


Then 

and 


l , yX = SSj'Sf)S = -S 3 = S 

T%= -ff 2 = 3 . 
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Hence 7 is a symplectic matrix and we have an anti-involution of the first kind. 
(2) If 力 ’ 肋 =set X = —Sj^. We obtain consequently 

yyr = -肋'肪 5 = -5 3 = 5, 

% te Z = S)^ = -z, 

that is, X defines an anti-involution of the second kind. 

Further, from ?p*T?p -1 = Zi, we have 


Therefore, in order to classify anti-involutions, we have to classify their corresponding 
hypercircles. 

Since 

_ Aff = 士衣 一入 S 

has characteristic roots either all +1 of all —1 and has simple elementary divisors, we 
have ① a symplectic matrix ^3 such that 


平 • 邱 , = 




where H\ is a diagonal matrix with p terms 1 and q terms -1. 
Since the discriminantal matrix is 士 then we have 


H[H 2 = -I, H' 2 Hx = -I 


for anti-involutions of the first kind, and 

H[H 2 = I, = I 


for anti-involutions of the second kind. 


Thus, for anti-involutions of the first kind, the hypercircle is symplectically 
junctive to® 



con- 


( 20 ) 


① See the author's paper On the theory of automorphic functions of a matrix variable, II, 
Amer. J. Math., 1944, 66: 531-563. 

® Two Hermitian matrices H and K are said to be conjunctive, if there exists a matrix A such 
that All A' = K. 
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and for the second kind, it is symplectically conjunctive to 

(o' O' 

Therefore, we have the following theorem. 

Theorem 7 An anti-involution of the first kind is equivalent to 

W = Z _1 


( 21 ) 


( 22 ) 


and an anti-involution of the second kind is equivalent to 

W = -HZ~ l H, (23) 

where H denotes a diagonal matrix with p terms 1 and q terms —1, pK ： Q. 

We may also prove that they are all non-equivalent, since their hypercircles are 
non-equivalent. 

An anti-involution, equivalent to (23), is called an anti-involution of signature 
(P ， 9)- 

Remark (22) is equivalent to 

W^-Z. (24) 


In the case p = q, (23) is equivalent to 



6 . Decomposition of involutions into anti-involutions 


(25) 


Now we are going to express involution as product of anti-involutions. 

Theorem 8 Every involution is a product of two commutative anti-involutions 
of the first kind. 

Before proving Theorem 8, we give the following rules concerning the multipli¬ 
cation of anti-symplectic transformations : 

(1) The product of two anti-symplectic transformations with matrices Ii and I2 
is a symplectic transformation with matrix T*X2 ； 



Then 

^' = (o h)=^ 2 ' 

which corresponds to the involution W = HZH of the first kind. Further = 
*1找2 = 3, hence 1i and I 2 represent two anti-involutions of the first kind. 

(2) Let 



Then = 3 - Further 

which corresponds to the involution Z\ = —Z~ l of the second kind. 

Theorem 9 Every anti-involution of the second kind is a product of three 
mutually commutative anti-involutions of the first kind. 

Proof The anti-involution of the second kind 

Zi = -HZ~ l H 

is a product of Z\ = —and Z 2 = HZH, and the former one is a product of two 
anti-involutions of the first kind: 

Z\ = —Z 3 , Z 3 = Z^ 1 - 


The three anti-involutions so obtained are evidently mutually commutative. 

Remark Theorems 8 and 9 (and later Theorem 10) suggest that anti-involu¬ 
tions of the first kind can be used as generators in the group of symplectic and anti- 
symplectic transformations. Does the anti-involution of the second kind play the 
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Now, we are going to show that, in particular, for = 5 we have no I 2 satisfying 
this condition (notice that no generality is lost). From 


i 2 y=-3^ 

and (by Theorem 3) Z 2 S = ( 玄 2®)*’ = —3^3’ ， we deduce that ^2 i 8 symmetric. 
Further, the equation 

X2 究 , == -3 


is impossible, since the matrix on the left is a positive definite Hermitian matrix, 
while the matrix on the right is negative definite. 


7. Decomposition of symplectic transformations into involutions 

First of all we give the normal form of a symplectic matrix 1. We have a sym¬ 
plectic matrix 识 such that <p -1 15p is a direct sum of symplectic transformations of 
the forms 

+ 5 (<T) (27) 


W {r) = a#±l,r odd, 


(28) 


4 T) - 


^ 0 0 ••- a ) ^ 0 0 ••- 0 

By a direct sum of two symplectic transformations with matrices 


( Bi 

Ci Di 


M 


A T) b 2 ' 

C2 


we mean a symplectic transformation with the 2 (ct + t)- rowed matrix 



This result, according to valuable information from Professor H. Weyl, is due to 
Williamson; however, it was also proved by the author independently. Since William- 



son’s paper is not available in China, the author is obliged to give the preceding result 
without a necessary quotation. 

Theorem 10 Every symplectic transformation is a product of two involutions 
of the second kind; consequently, every symplectic transformation is a product of four 
anti-involutions of the first kind. 

Proof (1) Let 


Evidently , 屯 is an involution of the second kind. Let 


•4 0、 

o d 


卿 )2= (U) - 3 ’ 

since J a J = JJ' a . Thus X is a product of two involutions of the second kind. 
(2) Next we consider the case 


M (n) = M = (mij), mij = 


and M 2 = (Uj). Then, by (30), 

Uj = 53 ; ( _ i)* -1 ( 

h=l ' 

Therefore, we have 




0, for i ^ j, 
1, for i = j. 


Consequently, the matrix 




iM 0 
0 -iM' 
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denotes an involution of the second kind. We may verify directly that 


JiM (n) = 


0 

-M( n_1 ) 


Thus 


邱 = 


'JiM \ 

0 ) 

is also an involution of the second kind, since S — 

(3) Similarly we treat the case 


(J-i 5J11 1 \ 

V 0 ^1* ) 


On account of the result quoted at the beginning of the section, we have the 
theorem. 


8. Geometries induced by anti-involutions 

The results of Sections 8, 9, 10, 11 and 12 can be used to introduce several types 
of geometry; the detailed study will be given elsewhere. 

We take an anti-involution 3 of the first kind as an absolute. The group G 
formed by all symplectic transformations T commutative with 3 is called the group 
of motion. Correspondingly, we have a hypercircle with the matrix QS- The group 
of transformations Z with X3 = 31* form a subgroup G\ whose index in G is equal 
to 2. 

In fact, from 

= 3TI 

and 

T 2 3 = 

we deduce H23 = = - 

As the transformation 


of G\ carries 


(W 1 ,]V 2 ) = Q(Z 1 ,Z 2 )T 

(z 1 ,z 2 )m(2 1 ,z 2 r 


into 

Q(m,mmsr , (iv 1 ,iv 2 rQ , = 


(32) 

(33) 




motion of the space. 

Let Pi and P 2 be two points of the hyperbolic space. Every motion evidently 
carries the cross-ratio matrix 

{ 巧，巧 }= ( 巧 , 3 (A) ， i^ ，3 (ft)) 

= (Pi- p 2 )(Pi - P2)(z{Pi)~ 3 ( ⑸ r 1 广 1 (34) 

into a similar matrix. (34) is called the distance-matrix between two points. We have 
the following theorems. 

Theorem 11 The hyperbolic apace is transitive. 

Theorem 12 The distance matrix has simple elementary divisors and posi¬ 
tive characteristic roots. Two point-pairs are equivalent if and only if their distance 
matrices have the same characteristic roots (multiplicities are counted). 

Theorem 13 The space is symmetric. 

By a symmetric space, we mean that to each point of the space there exists an 
involution with the point as its isolated fixed point and that every transformation of 
the space having the point as fixed point is commutative with the involution. 
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is real. This is Siegel’s generalization of the Poincare half-plane. 
Sometimes we take, by Theorem 7, 



The space is formed by those points for which I — ZZ is positive definite. The group 
of motions consists of the transformations 

W = {AZ + B)(BZ + Ay 1 . (36) 

This is a generalization of the unit circle in the complex plane. 

Theorem 11 is evident, since any point P = Q + iR with positive definite R may 
be carried into iR by the transformation W = Z — Q of (35) and since there exists a 
real matrix C such that CRC' = /, then W = C -1 ZC /-1 carries iR into il. 

To prove Theorem 12, we use the second representation. We may take one of the 
points to be zero. Then {Z,0} = (Z,2 -1 ,0,oo) = ZZ, which is positive definite. We 
have a unitary matrix U such that 

and 

UZU' = [Xu- - ,Xn). 

This gives Theorem 12. 

The space is transitive- In order to prove Theorem 13, we have to show that 

there exists one and only one involution which has 0 as its isolated fixed point and 

that the subgroup leaving 0 invariant is commutative with it. 

The transformation leaving 0 invariant carries also oo into itself. Then it takes 
the form 

W = AZA'. (37) 

(37) denotes an involution if and only if j 4 2 = The equation Z = AZA' has 
no other solution but zero if and only if the characteristic roots of the second power 
matrix of A are different from 1. Further the roots of A are ±1 and 士 i; the only 
possibility is that A = ±»7 by Theorem 44.4 of Mac-Duffee®. Now we have the 
unique involution 


① Theory of Matrices, 1933. 
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W ^ -Z. (38) 

It is evidently commutative with all transformations (37). 

Remark In order to prove the promised statement of the preceding section, 
it is sufficient to prove that the points 

X + Yi, X, Y real, 


are classified into transitive sets according to the ranks and signatures of Y. This is 
evident, since Z\ = Z — X carries the point into Yi and Z\ = AZA' carries into 
a diagonal matrix of the form i[l, … ， 1 ， _1 ，…，一 1 ， 0, ... ,0]. 

10. Elliptic geometries 

Now we take 3 to be an anti-involution of the second kind as an absolute. More 
definitely, we take 

3 

where 

H =[ l , ■■- ，1，_1， …， 1], 

where there are p l's and q -l'a, q>q. The hypercircle has the matrix 
H 0 \ 

OH). 

The space is formed by the points for which 

ZiHZ[ + Z 2 HZ' 2 = (Z u Z 2 m(Zi ，名 2 广 (39) 



is of the same signature as H. This geometry will be called the elliptic geometry 
of signature (p,q)- The group of motions is defined by the symplectic transformations 


(Z 1 ,Z 2 ) = Q(W 1 ,W i )% 


(40) 


with 

T3 = 3T, (41) 


since (40) carries (39) into 

< 3(^1 , {Wi, Wi)'Q! = QiW^Wi^WuWiY'Q'. 


In the case p = q, the transformations (40) with 33 = - -3- 
also leave the space invariant. Then the group of motions is now constituted by both 


kinds of transformations. 
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FYom (41), we have 


Theorem 14 The space is transitive. 

Proof For any point (v4, B) of the space making AHA' + BHB' of signature 
(p, q) we can find a matrix P such that 


P(AHA' + = H. 

V= ( PA PB ) 

\ -HPBH HPAH I 


is symplectic, in fact 


PA(PB)' = PAB'P' = (PB){PA)\ 
{-HPBH)(HPAH)' = (HPAH)(-HPBHY 

and 

PA{HPAH)' - PB(HPBHY = PiAHA' + BHB')P'H = /. 

The transformation (Zi, Z 2 ) = is a motion of the space, since 


and it carries (1, 0) into the point (A,B). 

The group of stability at the origin, that is, the subgroup of transformations 
leaving 0 invariant, consists of the transformations of the form 


where >1 is a coryunctive automorphism of H, that is, AHA' = H. We may prove 
also the following theorem. 


As in §9, we may introduce the distance 
by {Pi,P 2 } = (Pi,3(Pi),P 2,3(P2)) - Putting J 
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Here the problem arises to classify symmetric matrices Z under the group of conjunc¬ 
tive automorphisms of an Hermitian matrix. It would require too much space to solve 
this problem: hence at present we content ourselves with the special case H = I, 
which is the direct generalization of elliptic geometry. 

Theorem 16 The distance matrix has simple elementary divisors and posi¬ 
tive characteristic roots in the dliptic space of signature (n, 0). Two point-pairs are 
equivalent if and only if their distance matrices have the same characteristic roots 
(multiplicities are counted). 

Notice that in the elliptic space of signature (n,0) the group of motions is formed 
by all unitary symplectjc matrices. 

11. Geometries induced by involutions 

Now we take an involution 0 of the second kind as the absolute; without loss of 
generality we may take 

W = -Z. (44) 


(A 


(U 0 V 


0 ^ 


B ) 

U 

d) 

( 0 -a J 

"lo 

-iJ J 


D ) 


we deduce that B = C = 0. Thus the transformations of the space are given by 

Zj = AZA', (45) 

which form a group Gi . If the involution Z\ = —Z~ l is adjoined to Gi, a group G is 
obtained which contains Gi as a subgroup of index 2. The geometry of the symmet¬ 

ric matrices under the group Gi is equivalent to the algebraic problem “congruent 
classification of matrices” • Evidently the space is not transitive. The rank of 尸 is the 

characteristic invariant for a finite point. 

For a pair of finite points Pi and P 2 , we have (Pi,P 2 jO, 00 ) — PiP】、• Thus the 
elementary divisors of PiP^ 1 characterize the equivalence of a pair of finite points. 

Now we take an involution 3 of the first kind with signature (p,q) asan absolute, 
for example 

Z x = HZH. (46) 

Now the matrix % of the transformations satisfies 


( ^ 



0 V 


0 "l 

( A 


U 

d) 

[o 

h)- 

'U 


\c 
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Consequently we have 



Then 2 may be considered as a direct product of two symplectic matrices 



The group so formed is denoted by Gi- 
The equation 

f A B \ f H 0 \__(H 0 \ / A B \ 

V C 1> 八 0 H ) \ C D ) 

cannot hold, except when p = q, n = 2p. Then we have a group G which is generated 
by Gj and the additional transformation 



For simplicity’s sake, we take p = n — 1, q = 1, that is, the absolute is a 
fundamental involution. 

Theorem 17 The arithmetic distance of P and 3(P) characterizes the equiv¬ 
alence of points. More definitely, every point is either equivalent to 0 or equivalent 
to 


Proof Let 


Then 
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carries P into 



(47) 


If 17 = 0, there is nothing to be proved. Otherwise, we have a matrix M such that 


vM = (1,0,• • • ,0),M = Then the transformation 



carries (47) into the required form. 

The non-equivalence of the two points given in Theorem 17 is evident. 

It is easy to extend the theorem for arbitrary p and q. 

Remarks (1) The geometries so obtained are generalizations of the complex 
analogy of the Mobius geometry of circles. There is a particularly great variety of 
geometries over the real field. 

(2) We may take several commutative involutions as absolutes; for example, 
we take W = —Z, W = Z~ l as absolutes, then the transformations take the form 
W = TZT', where r is an orthogonal matrix. Thus “the orthogonal classification of 
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of points, we introduce a “simple ratio-matrix”，that is, 

(X 1 ,X 2 ,X 3 ,oo) = (X 1 - X 2 )(X 3 - X 2 )~\ 

Further, the transformations (48) with unitary A forma subgroup. The geometry 
under this group may be called the Euclidean geometry of symmetric matrices. The 
transformations (48) with A of determinant I form also a group. The corresponding 
geometry may be called the special Laguerre geometry. 

13. Fixed points of an involution of the second kind 
In §13, §14 we determine the fixed points of involutions. 

Theorem 19 An involution of the second kind has two isolated fixed points; 
they form a nonspecial pair, that is, a pair of points with arithmetic distance n. 
Conversely, a nonspecicd pair of points determines uniquely an involution of the second 
kind having the points as its isolated fixed points. 

Proof (1) It is sufficient to prove that 土 iJ are two isolated solutions of 

Z 2 = -I. (49) 


If Zq is a non-scalar solution of (49), then TZoV is also a solution for all orthog¬ 
onal r. Thus Zo is not an isolated fixed point of TV = —Z -1 . 

If Zo is scalar, then Zq = ±t/. From (t/ + A) 2 = —I we have △(△ + 2il) = 0. 
For sufficiently small A, A + 2i/ is nonsingular, consequently A = 0. This establishes 
that ±il are the two isolated fixed points. 

(2) Conversely, let 0 and oo be two fixed points. Then the involution takes the 

form 

W = AZA', A 2 = -I. 

Using this in connection with (37), we have W = —Z. 

Theorem 20 Let Pi and Pi be two isolated fixed points of an involution of 
the second kind; all its fixed points X are given by 


r{P u X) + r(X,P 2 )=n 


(50) 


and conversely ①. 

Proof (1) All the solutions of (49) are given by 


Z = T 


-J ⑼ i 
0 


0 


r 7 , 


① r(A, B) denotes the rank of A — B; it is called the arithmetic distance between A and B. 


r(z ， H)=r ((-r* : 

and r(—Ii,Z) = q. 

(2) Now we let 

r(Z, il) = p and r(Z, —il) = q. 
There exists a matrix Q such that 

qzq , = ^2'4 Ck) > Q J Q' = 5Z ' J(Ck) ' 



Direct verification shows that Cfc = 1 and that d(Z — XI) = 0 has i as root of 


such that 

Z = Q 

Theorem 21 Let Pi and P 2 be two isolated fixed points of an involutic 
the second kind. Let X be a point satisfying r{Pi,X) = r{P 2 ,X) = n, and let X 
the image of X under the involution. Then Pi,P 2 , X, Xq form a harmonic rangi 


'I^H 0 
0 -/<*»)» , 
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where Z\ and are called components. 

Theorem 22 Let p + q = n. The fixed points of an involution of signature 
(p, q) form a decomposable subspace of type {p,q). 

Proof We write 


HZH _ Z = 


jlp) 

o 、 

0 

-IM ) 


Zl2 \ 

^12 

蝴 J. 

0 

_2Zi2 

-2Z' l2 0 


= 0 , 


which implies Zyi = 0. Thus we have the theorem. 

Theorems 22 and 2 give us a general definition of decomposable subspace: 

Let ^ be a skew symmetric symplectic matrix. The points satisfying 


{Z X ,Z 2 )^Z U Z 2 )' = Q (52) 

define a decomposable space. It is of the type (p ， q), if d ( 只一入 ？）= 0 has 1 and — 1 
as roots of multiplicity 2p and 2q respectively. 

The justification is almost evident, since on multiplying = Q(Zi ， Z 2 )T 

on the right by 5(Z], Z 2 )’，we obtain 

= Q{Z 1 ,Z 2 )R{Z l ,Z2)' 

(notice that (Z\, Z 2 ) and (Wi, W 2 ) denote the same point, if and only if (Zi, Z 2 )5 (H^i ， 

w 2 y = 0). 

Further we may prove that the involution is uniquely determined by the decom¬ 
posable subspace. 

In the same way we may define the manifold of the fixed points of an involution 
of the second kind. 

15. Fixed points of an anti-involution 

Now we are going to find the fixed elements of anti-involutions. 

Theorem 23 The fixed points of an anti-involution of the first kind form a 
connected piece of dimension n(n + 1)/2 (real parameters). 




Now we are going to determine the dimensions of the manifolds formed by invo¬ 
lutions and anti-involutions. 

Theorem 25 The number of parameters {complex) of involutions of the second 
kind is equal to n(n + 1) and that of involutions of the first kind of signature (p, q) is 
equal to 4pq. 

Notice that Apq 彡 n 2 < n(n + 1). 

Proof (1) By Theorem 19, we have the first part of the theorem. 

(2) Now we consider involutions of the signature (p,q). It' is known that the 
group of 2n-rowed symplectic matrices depends on n(2n +1) parameters. We are 
going to find the number of parameters of the subgroup leaving the decomposable 
subspace 

(X^ 0 ) 

I 0 x^) 
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invariant. By the result of §11 and Theorem 22, the number of parameters of the 
subgroup is equal to p(2p + 1) + q(2q + 1). Therefore the number of parameters of 
involutions of the signature [p,q) is equal to 

n(2n + 1) - p(2p + 1) — q(2q + 1) = 2(n 2 - p 2 - q 2 ) = 4pq, 
since n = p + g. 

Now we give also the number of parameters of anti-involutions. 

Theorem 26 Anti-involutiona of the first or the second kind each form an 
n(2n + 1) parametric family (real parameters). 

Proof We consider the anti-involution T of the first kind. Put 

T=(3-6ff)(0 + 65)- 1 . 

FYom T*X = 3 and = 5 we deduce easily that 

6 + 6 * = 0 , 6 = 6 ’. 

That is, © is pure imaginary and symmetric. The theorem follows. 

In the case of anti-involutions of the second kind, we have correspondingly 

636* = 5, e = S , . 

Putting 0 = (3 - <P)Q + W _1 again, we have = 0,«P = ^, that is, 



where S is symmetric andis Hermitian. The number of parameters is n(n+l)+n 2 = 
n(2n + 1). 

17. Dieder manifolds 

In this section we study the dimension of dieder manifold. 

Definition Let A and B form a nonspecial pair. The locus of the points X 
satisfying 

r(AX) + r(X,B) = n (55) 

is called a dieder manifold of the point-pair {A,B). 

As a consequence of Theorem 20, the fixed points of an involution of the second 
kind form a dieder manifold of its two isolated fixed points. It is called the dieder 
manifold of the involution. 




Theorem 27 Let © be an involution commutative with X, an involution of 
the second kind. Then 6 carries the dieder manifold, of % into itself. 

Proof © carries the isolated fixed points (_A ， B) of X to (C, D). Then (C, D) 
are isolated fixed points of ©I© 一 1 . Since 6X6 -1 = 土 X，© either keeps A and B 
fixed or permutes A and B. In both cases the dieder manifold is invariant. 
Definition The part of the dieder manifold satisfying 

r(i4,X) =p (56) 

is called the component of index p with respect to A. 

Theorem 28 The component of index p is of dimension 2pq. 

Proof It is now more convenient to use the homogeneous coordinate-system. 
We may assume that A and B are (0,1) and (/, 0) respectively. The component of 
index p is constituted by the points 

{Z U Z2), ( 57 ) 


where Z\ is of rank p and Z 2 of rank q. Evidently 



establish the following two facts: 

(1) We have no Q, different from the identity, such that 


，««.))=(( 


(58) 


(59) 


(2) In general, each point on the component is of the form (58). 

To prove (1), we let 

9u 912 
921 922 

Comparing the elements on both sides of (59), we have qu = 1^, qi 2 = 0, 921 = 
0, 922 = - Thus (1) is true. 

(2) The transformation leaving (0, 1) and (/, 0) invariant is of the form 



W x = QZiA, W 2 = QZ 2 D, AD' = I. 
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Thus the points on the components axe given by 


We take the general case d(on) ^ 0, d{dn) ^ 0. Thus 

( on 0 ) an a 12 ) ( 0 0 

V 0 °2'2 八 V 。 0 广 V 吻幻 2 厂 

gives the required form. The conditions of inequality are irrelevant in counting the 
parameters. 

Consequently, two components of indices r and s of a dieder manifold are not 
equivalent topologically, if r 〆 s and r + s / n. As a consequence of Theorem 27, we 
have the following theorem. 

Theorem 29 A dieder manifold is of dimension 2[n/2](n - [n/2]) where [x] 
denotes the integral part of x. 

18. Commutative involutions of the second kind 

Without loss of generality the problem to be solved can be stated as follows: 
Given an involution of the second kind, say (12), find all involutions of the second 
kind commutative with it. 

Let T be the matrix of the required involution, then T 2 = —% and 


( u 

0 ) 

=±[ u 

0 ) 

\ 0 


V 0 



(1) Taking the upper sign, we have 
B ' 
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Now we are going to find its isolated fixed points. There exists a nonsingular 
matrix T such that rAT - 1 二 iH, where 丑 =[1, …， 1, 一 1, • - • , —1], where there are 
p l’s and q —l’s. Let 

w 0 = rivr , , z 0 = tzv, 


we have 

W 0 = -HZ 0 H (62) 

and notice that (12) takes the form Wo = —Zq. We drop the subscript 0. 

Using homogeneous coordinates, we have 

{Zx,Z 2 ) = Q(W, ， W 2 )( l : 丄). （63) 

The transformation 



(I p 

V 4 

u ;) 

, p = 

{ I ⑹ 
{ 0 

»)• 

u = 1 

( 0 0 
^ 0 J ⑷ 

) 

(64) 

:8 (63) into 

4 iH 

0 、 

|x _1 = 

:( IP 


f iH 

0 ) 


-iJ, \ 


\ 0 

- iH , 


I p ) \ 

K 0 

-iH ) 

V 

IV ) 



il 0 彳 
0 -il ) 



The last matrix represents (12), and it has 0 and oo as its isolated fixed points. Thus 
the isolated fixed points of (63) are (P, il g ), {H q ,I p ) which lie on the dieder manifold 
of (12). 

(2) Taking the lower sign of (60), we have A = D = 0. Then 


I 


0 

-B'- 1 


B 

0 


Since 



B is symmetric (=5, say). Then we have 


w = sz~ l s. 


(65) 
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It has ±5 as its two isolated fixed points. Since 0, oo,S, —S form a harmonic rar 
we have the following theorem. 

Theorem 30 If an involution of the second kind is commutative with a fixed 
involution of the same kind, its isolated fixed points either lie on the dieder manifold 
of the fixed involution or separate the isolated fixed points of the fixed involution 
harmonically. 

The proof of Theorem 29 suggests also the following theorem. 

Theorem 31 The involution of the second kind commutative with a fixed 
involution o/ the second kind depends on n(n+ 1)/2 parameters (complex). 

Proof The case (2) gives us n(» + 1)/2 parameters, as 5 is symmetric. The 
case (1) gives 2[»/2](n- [n/2]) (< n(n + 1)/2) parameters as shown by the following: 
Lemma The solution of the matrix equation 

A 2 = I (66) 

depends on 2[n/2](n - [n/2]) parameters. 

In fact, let A have 1 as characteristic root of multiplicity p, and -1 of multiplicity 
q. The most general solution is given by 

where T is nonsingular. 

Further, if - ,1,-1, •• - , — l]r = [!,••• ,1,-1,-•• . - 1]> then 



which depends on p 2 + g 2 variables- Thus for a fixed P, A depends on 
n 2 — p 2 - g 2 = 2pq 

parameters. This expression has its maximum for p = [n/2]. 

Since 

2[n/2](n- [n/2]) < 2(n/2) 2 <n(n + 1)/2, 

we have Theorem 30. 
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19. Number of parameters of involutions commutative with a given invo¬ 
lution 

Now we push Theorem 31 a little further. 

Theorem 32 The involutions commutative with a fixed involution of the sec¬ 
ond kind depend on n(n + 1)/2 parameters. 

Proof We need only to consider the involutions of the first kind commutative 
W — —Z. Then, we have either 


Zi = AZA!, A 2 = I, 


(67) 


or, for even n, 


Z, = KZ~ l K, 


( 68 ) 


where K is skew symmetric. (67) depends on [n/2](n - [n/2]) parameters by the 
lemma of the last section. (68) depends on n(n - 1)/2 parameters. Both do not 
exceed n(n + 1)/2. 

Theorem 33 The involutions commutative with a fixed involution of the first 
kind of signature (p,q) depend on p(p + 1) + 9(9+1) parameters. 

Proof We take Z\ — HZH, = [1, •• - ,1, -1，... ，- 1] as the fixed involution. 
Let 

1 

be an involution commutative with it, then we have 




(1) Taking the upper sign, we have 

-(f ：)■ -(T ：)■ -(f ：)■ -(t 1 ：)■ 


where 



are involutions of the same kind. By Theorem 25, the number of parameters is 


p(p+l) + g(? + l). 
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(2) We take the lower sign in (69). This is possible only when p = q. Then 



We have T 1 T 2 = and T\ is symplectic. Thus the number of parameters of 2\ is 
equal to p(2p+ 1) and 7^ is determined uniquely. Since 


p(2p + 1) < 2p(p + 1) 

we have the theorem. 

Notice that 

p{p + l) + q(q + l) = p 2 + <^ +p + q 

^ (p + ?) 2 /2 + P + 9>n(n + l)/2. (70) 

20. Farther study of commutative involutions 
First of all, let us classify commutative involutions. 

Definition Two commutative involutions of the second kind, Ti and T 2 , which 
have isolated fixed points forming a harmonic range, are said to be commutative 
nondegenerately. Otherwise, we say they are commutative degenerately. For the 
degenerate case, the arithmetic distance between an isolated fixed point of and 
the other of X 2 is either p or q. We assume that 0 < p < n/2. Then p is called the 
arithmetic distance of !Ti and 玄 2 . 

Analytically, Ti and T 2 are commutative nondegenerately if and only if 

TiT 2 = -T 2 T, (71) 

(see (2) of §18). In the present section we are going to establish the following theorem. 

Theorem 34 Let n = 2 a r, where t is odd. There are <7 + 3 involutions of 
the second kind any two of which are commutative nondegenerately, and <r + 3 is the 
maximal number. 

In other words, there are a+3 nonspedal pairs of points such that any two pairs 
form a harmonic range. 

We prove the present theorem together with the following: 




Theorem 35 Let n = 2 CT r, where r is odd. Let P(n),S(n) and K{n) denote 
the greatest integer s for which there exist s n-rowed symplectic, symmetric and skew- 
symmetric matrices … ,T a such that 

T? = -I, TiTj = -TjTi, (72) 

respectively. Then 

P{n) = <t + 2 {<t ^ 1), (73) 

i7(n) = a+l (<7^0) (74) 

and 

K(n) = (<r ^ 1). (75) 

Notice that (73) implies Theorem 35 and that P(n) and K(n) are defined only 
for even n. 

We establish (73), (74) and (75) as consequences of the equalities 

P(2n) = 2 + i7(n), (76) 

S(2n) = 2 + K{n) (77) 

and 

K(2n) = 1 + K(n) (78) 

and the three initial equalities 

K(2t) = 1, (79) 

r(T) = 1 (80) 

and 

S(2t) = 2 (81) 

for odd r. 

We are going to prove (76) first. We start with the involution (12). By (2) of §18, 
the involutions commutative with (12) are of the form (65), where 5 is a symmetric 
matrix. There is a matrix T such that r ; r = 5 -1 . We put r'WT and T'ZT instead 
of W and Z, then (12) takes its original form and (65) becomes 

W = Z~ l . (82) 


Hence any pair of nondegenerately commutative involutions may be carried simulta¬ 
neously to (12) and (82). 
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Now we consider an involution of the second kind commutative with both (12) 
and (82). Since it commutes with (12), it is of the form (65); since it commutes with 
(82), we have S 2 = —I. Let 

W = SiZ^Si, 1<*< P(2n) - 2, 

be involutions commutative nondegenerately. Then we have = —I, = 

that is, 

SiSj = -SjSi, 5? = -I. (83) 

Therefore, we have 

P(2n)-2^ r(n). 


On the other hand, if we have i7(n) symmetric matrices satisfying (83), we may find 
2 + T(n) symplectic matrices with the required property (in the following we shall 
not repeat this statement at similar occasions). Therefore we have (76). 

From (83), it follows that Si has simple elementary divisors and ±i are its char¬ 
acteristic roots. We have an orthogonal matrix T such that 



If there exists an S 2 such that 




instead of Sk, we find that (84) remains unaltered and S 2 takes the form 


If (85) is commutative with (87) nondegenerately, for / ^ 3, then the Ai(l > 3) 
are skew symmetric and satisfy Af = —I^ a \AiAk = —AkAi. In case o is odd, there 
is no such A, we have therefore (81). Otherwise, we have (77). 

Now we start again with (2n)-rowed skew symmetric matrices 


^ = -3- 


From we find 


K lk K 2k 
v K^k —Kik 


where Kik and K 2 k are skew symmetric matrices. 
We introduce the transformation 




Then = 5 and 


Lie = —Kik + iKik, 

Affc = + iK\k- (【 

4(fc > 2) can exist only when n is even. Thus we have (79). From (88), 


LiMi = / 


(92) 




Remark It we restrict the J s to oe ortnogonai m tne tneorem, tne maximum 
s is equal to K(n) = a. In fact, from 

T 2 = -/, TT = I, 
we deduce immediately T = —T 1 . 

21. Parameters of involutions commutative with a pair of involutions 

Now we establish the characteristic distinction of all sorts of commutative invo¬ 
lutions of the second kind. 

Theorem 36 Given two nondegenerately commutative involutions of the sec¬ 
ond kind, the involutions of the second kind commutative with both depend on [n/2] 
(n — [n/2]) parameters (complex). 

Proof (1) Now we take, without loss of generality, the two fixed involutions 
as (12) and (11). 
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The involutions, nondegenerately commutative with both given involutions, are 
given by 

W = SZ^S, 5 2 = (98) 

We have an orthogonal matrix T such that rST' = Hi, H = [1, ■ • • , 1, —1, …， —1]. 
The totality of orthogonal matrices T depends on n(n — 1)/2 parameters. Further 
from THr 1 — H, we deduce 

k 0 t ⑷人 

which depends on p(p - 1)/2+ q(q- 1)/2 parameters. Thus S depends on 
(n(n - 1) - p(p - 1) - q(q - 1))/2 = pg< [n/2](n - [n/2]) 

parameters. 

(2) Consider the involutions commutative degenerately with one of the fixed 
involutions, say W = -Z. Then they are of the form (61). If they commute with 
W = —Z~ l , either degenerately or not, then 

W = AZ~ l A' = (AZAT 1 = A'~ l ZA~ l 
and we deduce A' A = ±1. Then 

A'= -A'A 2 = TA, 

that is, i4 is either symmetric or skew-symmetric. If it is symmetric, the number of 
symmetric involutions A has been counted in (1). 

Now we assume that A is skew-symmetric and orthogonal. Consequently n is 
even, equal to 2p, say. Since A 2 = A has 士 i as its characteristic roots and its 
elementary divisors are simple. Since 

d(A - XI) = d(A - XIY = d(-A- A/), 

the multiplicities of i and —i are equal. 

The pair of matrices 



has the prescribed elementary divisors. Thus we have an orthogonal matrix T such 
that 
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The totality of orthogonal matrices depends on n[n — 1)/2 parameters and symplec- 
tic orthogonal matrices of order 2p depend on p 2 parameters. Thus the number of 
parameters is 

n(n-l)/2 - n 2 /4<((n- l)/2)(n-((n- 1)/2)) < [n/2](n - [n/2]) 
and the theorem follows. 

Theorem 37 The involutions of the second kind commutative with two com¬ 
mutative involutions of the second kind of arithmetic distance p depend on p(p+l)/2 + 
q(q + 1)/2 parameters. 

Proof We may write both fixed involutions as (12) and 

W = -HZH, (99) 


where = [1， …，1， — 1， •••, —1]. 

( 1 ) The involutions commutative degenerately with (12) are given by (61). If 
they commute with (99), we have AH = ±HA. Consequently, we have 



the second case can appear only when p = n/2. 

① The number of parameters of 

卜 0 ), a 卜 -Z ⑼， ai = -/(*> 

\ 0 a 2 J 

is equal to 2[p/2](p- [p/2]) + 2[q/2](q - [q/2]), by the lemma of §18. 

② FVom A 2 = —1, we deduce 


(: 



b l b 2 = -/ (p) . 


Thus the number of parameters is equal to p 2 , which is less than p(p + 1). 

(2) The involutions commuting with (12) nondegenerately axe of the form 


W = SZ~ l S. 

If they commute with (99), we have HS = ±SH. Consequently, we obtain 




2[p/2](p- b/2]) + 2[g/2](g- [q/2]) < 


22. Automorphisms of the group of symplectic transformations 

Finally, we are going to prove the following theorem. 

Theorem 38 A topological automorphism of the group of symplectic tmnsfor- 
mations is either an inner automorphism or an anti-symplectic transformation. 

More explicitly: Let ® be the group of symplectic transformations. It may be 
visualized as the totality of symplectic matrices X, but we have to identify T and 
-T. A is called an automorphism over 0, written as 

■4(1) =#， (100) 

if it builds up a one-to-one and bi-continuous relation between X and and if 

A^2) = ( 101 ) 

The conclusion of the theorem is that we have either a symplectic matrix 屯 such 

that 

= ( 102 ) 

or a symplectic matrix 屯 such that 

A{1) = ( 奸无 ) 郎 - 1 = (103) 

for all 1. 

Proof Evidently the automorphism A carries an involution into an involution 
and a pair of commutative involutions into a pair of commutative involutions. The 
involutions commutative with an involution of the first kind of signature (p, q) form 
a manifold of dimension p(p + 1) + q(q + 1), by Theorem 33, and those commutative 


with an involution of the second kind form a manifold of dimension n{n + 1)/2 by 
Theorem 32. Since the numbers 

n(n+ 1)/2, p(p + 1) + 9(? + 1)，1 < g < n/2 

are never equal, and since a topological transformation leaves the dimension invariant, 

nvaria 
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Further, the manifold formed by the involutions of the second kind permuting 
with a pair of commutative involutions of arithmetic distance p is of dimension 

p(p +1)/2+ g(g+1)/2 

by Theorem 37. They axe all different for 0 < ^ n/2. Moreover the manifold 

formed by the involutions of the second kind permuting with a pair of nondegenerate 
commutative involutions is of dimension [n/2](n — [n/2]), by Theorem 36. Therefore 
the “degeneracy” of a pair of commutative involutions is also invariant under A. 

Each involution of the second kind is determined uniquely its two isolated fixed 
points. Thus, we use (A, B) to denote an involution of the second kind possessing A 
and B as its two isolated fixed points. 

We now prove that two involutions with a common isolated fixed point, and the 
other two fixed points forming a nonspecial pair, are carried by A into involutions 
with the same property. In fact, let 

(A,B), (A,C) (104) 

be the two involutions under consideration, and suppose that A carries them into 

(P,Q), (R,S). (105) 


The transformations commutative degenerately with both involutions (104) form 
a group isomorphic to the orthogonal group. In fact we may take A = 0,B = oo and 
C = I\ the conclusion follows immediately. 

Suppose that at least one of the arithmetic distances r(P,R), r(P, S), r(Q, R) and 
r(Q,S) is different from 0 and n. Then the subgroup formed by the transformations 
commutative degenerately with (P, Q) and (R, S)is reducible. In fact, we may let, in 
homogeneous coordinates, 



Direct verification asserts that the group is formed by the elements of the form: 



^ 22-^22 = 
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Since the orthogonal group cannot be isomorphic to a reducible group, we have 
the conclusion that either our assertion is true or that 

r{P,R) = r{P,S) = r{Q,R) = r(Q,S) = n. 

The latter case cannot happen. In fact, there is no pair of matrices (X, V) to 
separate both (A, B) and (A,C) harmonically, and this may be justified easily by 
putting A = 0, B = oo and C = I. On the other hand, there exists a nonspecial 
pair {X,Y) to separate (P,Q) and (R,S) both harmonically. In fact, we may take 
P = 0,Q = <x>,R = I-, then the pair (Si, —Si) separates both pairs harmonically 
when Si = S. Thus we have established our assertion. 

Let 17(A) be the set of involutions having a common fixed point A. Then, by A, it 
is mapped into another set with the same property. In fact, if (A, B), (A, C), (r(B, C) 
=n) are mapped into a third pair (A, D) must be mapped into 

either {A^D^ or The latter case cannot happen, since 乂 cannot map a 

fourth one. We may remove the assumption r(B, C) = n by means of a consideration 



Suppose S{A) is carried into by A. Therefore A induces a mapping of 

the space of symmetric matrices ： (A —* A^). 

Let r(i4, B) = p. The points C satisfying 

r(A,C) = n, r(S, C) = q 

form a manifold of dimension nq — q(q + 1)/2. Let D be the image of B under the 
involution (^4, C). Then (A,C),{B,D) are two commutative involutions with the 
arithmetic distance q. Suppose A carries ^(A) and E{B) into S(A^) and 27( 丑十 ）. 
Then we find that r(A^,B^) = p or q. Since np — p(p + 1)/2 於 nq - q(q + 1)/2 for 
p^q,y/e have r(A^, B^) = p. 

Therefore the automorphism A induces a mapping on the space of symmetric 
matrices possessing the following properties: 

(1) It is topological; 

(2) It keeps harmonic separation invariant; 

(3) It keeps arithmetic distance invariant. 

The theorem follows from the generalization of von Staudt's theorem for sym¬ 
metric matrices. 




Geometries of matrices. III. Fundamental theorems 
in the geometries of symmetric matrices* 

1. Introduction 

The author gave, in the paper Ii ①， a discussion of the redundancy of the con¬ 
ditions which appeared in the generalization of von Staudt’s theorem for symmetric 
matrices of order 2. The author then could also give a proof for matrices of ail even 
orders but was unable to prove the result in its full generality, and further the proof 
depended essentially on some results about involutions. The author has now found 
a simpler proof which holds for any order greater than 2 and which is independent 
of the results about involutions, Nevertheless, the proof is carried out by means of 
mathematical induction upon the result of Ii, which depends essentially on the theory 
o£ involutions. 

Several theorems for geometries keeping an involution £is an absolute have also 
been obtained. 

The second part of the paper is concerned with analytic mappings. The pro¬ 
jective space of symmetric matrices may also be considered as the extended space of 
several complex variables as defined by Osgood®. So far as the author is aware, the 
completeness of the group of automorphic mappings of an extended space has been 
established only for two special cases, namely the space of function theory and the 
complex projective space. If we assume that the group is topological, we may deduce 
the result from a theorem of E. Catan® for semi-simple groups. In this paper, the 
problem is solved without any restriction besides analyticity. 

As an application of the previous result in combination with the continuity theo- 

* Presented to the Society, April 27, 1946; received by the editors May 30, 1945. Reprinted 
from Transactions of the American Mathematical Society, 1947, 61 ： 229-255. 

① Papers I, Ii, and II under the same title were published in TYans. Amer. Math. Soc., 1945, 
57: 441-481, 482-490; 1947, 61: 193-228. They will be referred to in the present paper simply by 
their numbers. 

② Lehrbuch der Funktionentheorie, 1929, 2. 

(f) La theorie des groupes finis et contmus el I'analysis situs. Memorial des sciences math4 - 






:em due to Levi 1 ^ and with a result due to the author'® 1 , we solve also the correspond- 
ng problem for the group of automorphs of the elliptic space. It should be remarked 
: hat the corresponding problem for the hyperbolic space was solved by C. L. Siegel ③ 
n a recent important paper. 

2. Arithmetic distance 

Let 6 and T be two sets of points in the ll projective" space of symmetric matrices, 
rhe arithmetic distance between S and X is defined to be the least upper bound of 
the arithmetic distance between any two points S and T, where S and T belong to 
S and 1 respectively. It is denoted by r(6,T). 

In particular for & = %, the distance r(S,S) is called the arithmetic diameter 
if the set 6. 













Froo 

consist 











v' 

v a 

is a nonsingular (p + l)-rowed matrix. The theorem is now evident. 

Is the converse of Theorem 1 true? It is true for p = 1, but in general we have 
the following “gegenbeispiel” ： 
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for all a, b, c implies that 



is of rank 1. This fact shows that S is a maximal set. On the other hand, we have 
r(6, 0) = 1, which cannot hold in the normal subspace. 

3. Arithmetic property of the normal subspace 

Now we extend the concept of dieder manifold a little further: 

Definition Let Pi and P 2 be two points. The points X satisfying 

r(P u X) + r(X,P 2 ) = r(P 1 ,P 2 ) 

are said to form a dieder manifold spanned by Pi and Pi. 

Theorem 2 A maximal set of rank p is a normal space if and only if it contains 
all dieder manifolds spanned by any two points of the set. 

Proof (1) Let 

P = 0, Q = [l, ••- ,0] 

(where the l’s are p in number); we shall prove that, if 

r(P,X)+r(X,Q)=r(P,g), 

then X takes the form 

(T ：) - 

We consider the pair of matrices Q and X. There exists a nonsingular matrix r ① 
such that 

r -i xr /-i = [1) ... ,1,0,... ,0], 

(where the l’s are p in number), and 

r -lgp/-l = q 

From the second equation, we find that 



① See 1\jrnbull and Aitken. Theory of canonical matrices, 1931 ： 135. 



Then 


x = r 


J(P) o 


V 0 0 / 

Consequently, we have the assertion. 

Therefore a normal subspace contains the dieder manifold spanned by any pair 
of points of the subspace. 

(2) Without loss of generality, we may assume that the maximal set contains two 
points 

尸= 0， <? = [1，... ，1，0,…，0】 

(where the l’s are p in number). So it contains 

Pi = [1, ••- ,1,0,1, •• ,1,0, ••- ,0], 1 < t < p 

(where the first zero in the brackets appears in the ith place, the second in the 
(p - l)th), and 

Pij = PiPj, Pijk = PiPjPk-, 

and so on. 

Let 

X = (Ors)l<r,a<n 

be a point of the set. Then 

r{X,Pi) ^ p, r(X,Pij) < p, r{X, P ijk ) ^ p, 


and so on. Consequently, the matrix 



Oil — fil 

012 •• 

°1P 

°lp+l 

Ol2 

022 - £2 ■- 

- 02p 

02p+l 

Olfi 

Clip 

_ ■ a PP - £ P 

°PP+1 

Olp+1 

0-2 p+1 

■ a P p+i 

<V+lp+l 

ain 

02n 

■ O-pn 

a P+ln 



is always of rank p for ei, ••- ， e p arbitrarily taken from 0 and 1. In particular, 


flu — £l Ol2 . • ■ Alp Oli 

ai2 022 — ^2 • • • a 2p «2< 

flip ci 2 P … o,p P — e p Up, 

au a2i … dpi O-ii 
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for /? < i < n and any choice of e. Putting ei = 0 and £i = 1 and subtracting the 
results, we have 

0-21 — £2 … O-lp O-ii 

: : = 0 . 
oap … Opp — e p Opi 
0-2i … flpi o-ii 

Repeating the same process, we have finally 

a pp ~ e p a pi 

Opt Ait 

for e p = 0 and 1. Consequently Ojj = 0 and = 0. Varying p, we find that aji = 0 
for aJl 1 < j < p. Further varying i from p + 1 to n, we have the assertion. 

Therefore the elements of the set are of the form 

/ 0 \ 

k 0 0 J. 

By Theorem 1 and (1) we have the second part of our theorem. 

Since pairs of points with arithmetic distance p (1 ^ p ^ n) form a transitive 
set, the proof of the previous theorem establishes also the following: 

Theorem 3 Given two points with arithmetic distance p, there is one and only 
one normal subspace of rank p which contains both. 

4. Proof of the fundamental theorem 



Theorem 4 A continuous mapping carrying symmetric matrices into symmet¬ 
ric matrices and leaving arithmetic distance invariant is either a symplectic transfor¬ 
mation or an anti-symplectic transformation. 

Proof (1) The theorem was proved for matrices of order 2 in Ii. Now we shall 
establish the general theorem by induction on the order of matrices. We thus suppose 
that the order of matrices is not less than 3. 

⑵ Let 


r{Z) = Zi 


⑴ 


be a mapping satisfying our conditions. The points of the form 


k 0 0 J 
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form a normal subspace of rank n — 1. Since the arithmetic distance is invariant, the 
set of points 


forms also a normal subspace of rank n— 1. Since the totality of all normal subspaces 
of rank n — 1 form a transitive set under the group of symplectic transformations, we 
may therefore assume that T satisfies 


It induces a continuous mapping on the (n- l)-rowed matrices W and it keeps arith¬ 
metic distance invariant. Therefore by the hypothesis of induction we find either a 
symplectic mapping 

W = (aWi + 0){^rWi +S)~\ a = a (n - x >, (3) 


and so on, or an anti-symplectic mapping 


W = {aWi + 0)(^ + S)~ l . 

The symplectic transformation Z = {AZ\ + B){CZi + D)~ l with 


carries the transformation T inducing (3) into a new one satisfying 


(0 \ 


Applying the same method to those V inducing (4), we conclude, in both cases, that 
we may assume, without loss of generality, that T satisfies (5). 

(3) Since 

r ( o( n -” o ) 

is a matrix of rank 1, we may let 


• • • ,On). 
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Since the arithmetic distance between 
(/("-!) 


,_1 > 0 ) ( 0 ( n_1 ) 0 ' 
0 oj’ L 0 b 


is equal to n, we have a n ^ 0. The transformation Z\ — A'ZA with 
/ /("- 1 ) 0 \ 

A= I . . , v = {a u ■■- ,a n _i) 

\ -^n 1 a- 1 ) 

carries (5) into itself and (ai,- • - , a n )’(ai, … ,a n ) into 

(°r :)■ 


Therefore, we may assume further that 

J 0("-” 


: H 


0 

0 1 , 


⑹ 


(4) The transformation T leaves 


/ 0 C > 

0 

o \ 

( 0 

0 0 、 

0 


0 ， 

0 

0("-2) 0 

l 0 

0 

oj 

1。 

0 l ) 


invariant, therefore, by Theorem 2, we find 


0 0 
0 w( n - 




0 0 

0 w[ n ~ 


By the supposition of induction, we have either 

W (n - l) = {aWi + 0){r(Wi + <5) _1 
or 

W (n ~ l) = (aWi + 0){riWi + 6)- 1 , 


⑺ 

⑻ 


a 0 
k T 5 / 


is a 2(n — l)-rowed symplectic matrix. 
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According to (5)，the transformation (7) and (8) leaves every point of the form 
W 


f X (n_2) 0 、 

0 0 


invariant; (8) cannot hold. 

Since (7) leaves every point of the form (9) invariant, we have 

(ai ' 

a = I 

\ 0 . 


)0 = 0 , 


' 0 4 

、 C 3 c 4 , 


f ±/( n - 2 > 0 

、干 aaaf 1 aj 1 


Further, (7) carries 


0 0 
‘ 0 1<” 


into itself; we have = 0. 
Let 


'±1 0 
l 0 c^- 1 ) 


B = 


' 0 0 
o ^(n-l) 


C= (o 

The transformation 

Z = (AZi + B){CZx + D)~ 
carries T into a new one satisfying (5) and 


±1 0 
0 5 (n_1) 


' 0 0 
, 0 Wh- 1 ) 


H 


0 0 
0 


since the transformation leaves every point 

( At— 1 ) 


0 


invariant. 

(5) Since 


⑼ 


( 10 ) 


( 1 0 \ _ 

-f 1 0 ^ 

r (o {n - l) «V 


^ 0 O^- 1 ) ) 

V 0 O^- 1 ) ) 


\o 1 ) 



- 130- 


华罗庚文集代数卷 II 


we have 



and by hypothesis of induction, we have 

： i )= H ^:: 之)+仆(之之 H ' 

which is a symplectic transformation (notice that we omit the “anti-symplectic” case 
by the same reason given in (3)). 

Since it leaves every point 

invariant, we have either 

a = J ( 2 )，0 = 0 , ， =(◦:) ， 6 = 



For the first case the transformation with 



and for the second case the transformation with 


A^ll,l, ■■- , 1 ,- 1 ], B = 0, 




D = A, 
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carry T into a new one which satisfies (5)，(10) and 



r([Ai, ••- ,An_i,0]) = [Aj,,A n -l|0], 

r([0, 入 2, …， An]) = [0, 入 2 ,… ，入 n] 

and 

r([Ai ， 0, … ,0, A„]) = [Ai, 0, ■•- ,0, A n ]. 
Now we are going to prove that 


r([Ai, • • • , A n ]) = [Aj, ■ • • , A„]. 

Since 

»*(|Ai，... ,A„_i,0]) = 1, 

we have 


r([Ai, •• - ,A„]) = [Ai, ••- ,A„_i,0] + (ai, ••- ,On)’(ai, … 


( 11 ) 


( 12 ) 


Since 

K[^l， …， ^nl» [0, 入2,…， AnJ) = 1， 
we have 02 = 03 = • • • = On-x = 0 . Further, since 
r([Ai ， A2, …)\»])[^ 1 , 0 ,•• - , 0 ,A„]) 

=”([ 入 1 ， … ,An-1,0] + (ai ， 0, … • • - ,0,On), [Ai,0, ••- ,0,A n ]), 
we obtain that the matrix 

a? aiOn 

aiOn a 三 —A n 

is of rank zero, that is ai = 0 and a* = A„. Thus we have the assertion. 

( 7 ) Let 

T{Z) = Z x , Z = (Zij), Zi = (z<j). 
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Since (12) holds, the equation 

d{Z-[X!, ■■- ,A n ]) = 0 

(consider Z as fixed and vary Ai, - ■ • , A n ) implies 

d(n" ,A„])=0 

and vice versa. This implies 

d(Z — [Ai,,An]) = d{Z\ — [Ai, ••- ,A„J) 
identically in Ai, •. • ， An. Consequently, we have 

2 » = z <i. - Aj = - 

and 

= ±z ij- 

The transformation 

[1 ， -1 ， 1 ， .-. ， 11Z[1 ， -1 ， 1 ， .. ， 1] 

carries Zu into itself and z\i into —z\i. Thus we may assume that 
=z' u . 

By the same consideration given in the proof of Theorem 8 of I, we have 

r(z) = z. 

The theorem therefore follows. 

5. Affine geometry of symmetric matrices 

Theorem 5 A continuous mapping carrying finite points into finite points, 
infinite points into infinite points, and keeping arithmetic distance invariant is either 
an affine mapping 

W = AZA' + S (13) 

or an anti-affine mapping 

W^AZA'-¥S. (130 

This is an immediate consequence of Theorem 4. 



6. Mobius geometry of symmetric matrices 


As in II, we let 3 be a fundamental involution as an absolute, for example: 

W = HZH, (14) 

where i/ = [1,1, - • • , 1, —1】. The arithmetic distance between P and 3(P) is either 0 
or 2. In case P — J{P) we say that P is a point-matrix. 

Theorem 6 A continuous mapping carrying point-matrices into point-matrices, 


non-point-matrices into non-point-matrices and keeping arithmetic distance invariant 
is a Mobius mapping or an anti-Mobius mapping. 

Proof We take 3 in the form of (14), which carries matrices of the form 

{ Z \ ：) ( 15 ) 
into themselves. Thus the mappings under consideration carry the set defined by (15) 
onto itself. 

Without loss of generality we may assume that the mappings keep 0 and oo 
invariant; by Theorem 4, it takes either the form 
W = AZA' 


For the first case, we put 
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7. Manifold at infinity 

Let M be an idempotent symmetric matrix, that is, M 2 = M. Then I — M is 
also an idempotent symmetric matrix, since (J — M) 2 — I - 2M + M 2 = I — M. 
Further M and I ~ M annihilate each other, namely 

M(I -M) = (J- M)M = 0. 


From an idempotent symmetric matrix M, we can construct a symplectic trans¬ 
formation 

W = (MZ +(/- Af))(-( / - M)Z + M)~ l . 

Now we consider those transformations with diagonal M. They are 2 n in number, 
including the identity. The 2 n — 1 non-identity transformations are called the funda¬ 
mental semi-involutions of the space of symmetric matrices. They are of fundamental 
importance in the study of the extended space. 

For the sake of later use, we give the explicit expression of the semi-involution 


with 


Let 


we have 


Theorem 7 


M 


/W 


0 


\ Z 12 Z 22 / 

W= ( Zll ~ ZuZ 22 Z 'l 2 -Z12Z22 \ ) 

\ ~ Z 22 Z 12 ~ Z 22 j 

Every point at infinity can be carried into a finite point by one of 


the2 n — 1 semi-involutions. 


Proof Suppose that 

(Zl,Z2) 

is the homogeneous coordinate of a point at infinity, with Zi of rank r. There exist 
two permutation matrices® P and Q such that 


PZ 2 Q 


^ * 


① Corresponding to a permutation 

1 2 … n 

we have a permutation matrix which is the matrix of the linear transformation x' p = Zj p (1 《 p 彡 n). 
Evidently, we have P' — P~ l . 
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where is nonsingular. We have a nonsingular R such that 

㈣ 


RP{Z u Z 2 ) 


Q'- 1 0 
、 0 Q / 


=(^ ，恥 ). 


Z\\ Zi2 

0 0 


We have immediately 


and 


W 2 T = 


.f W n W 12 
' V W22, 

'I -Z^z 12 、 

,0 J , 

Zu 0\ 

0 0 ) 


WiT '- 1 


t 0 W 2 ,) 

The zero at the left lower corner is obtained because (VTi, is a symmetric pair. 
Then M ^2 is nonsingular, since (Wi ， W 2 ) is a nonsingular pair. 

Let 

Af = 


’ /( r > 0 

00 , 


Then 


{W U W 2 )\ 


-M 


-(/ -M) M 
Pi = Wi (/ -M) + W 2 M 


0 W 12 
0 W 22 , 


= {Pl,P2), 


Zu Z12 

0 0 


which is evidently nonsingular. 
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Now we consider 

M I-M \ / Q 1 0 \ f Mi /-M x \ 

卜 Q 八 -U-M) M 八 0 Q f ) = \ -(I- Mx) M, J ， 

Mi=QMQ~\ 

which is a semi-involution, since Q' = Q _1 . The theorem is therefore established. 
Since 

d(-(I - M)Z + M) (17) 

is a principal minor of Z, we have the following theorem. 

Theorem 8 The manifold at infinity is carried by the 2 n — 1 semi-involutions 
into manifolds defined by equating the principal minors of Z to zero. 

8. Group of analytic automorphs of the space 

Now we come to the second part of the paper. The projective space of symmetric 
matrices may be regarded as an extended space of several complex variables as defined 
by Osgood. The aim of the present section is to establish the following theorem. 

Theorem 9 An analytic mapping carrying the extended space onto itself is o 
symplectic mapping. 

Proof (1) Let 

W = T{Z) (18) 

be an analytic mapping carrying the projective space of symmetric matrices onto 
itself. By a theorem due to Osgood ①， the mapping is birational; consequently, (18) 
may be written as 

Wij = Pij{Z)/q(Z), (19) 

where Pij(Z) and q(Z) are 2 _1 n(n + 1) + 1 polynomials without common divisor. 

(2) There is a point S at which 

Pij(S) ^ 0, q{S) ^ 0. 

The transformation Z = + S carries (18) into a new one with 

Pi/0) / ◦ ， 抑 ） # 0. (20) 

The transformation 


(D Ibid, p.295. 


W = ri-Zi 1 ) 


( 21 ) 
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maps also the space onto itself, and 

Wij= Pa(-Zr 1 )(d(Z 1 )) x /q(-Zr 1 )(d(Z l ))\ 
where 入 is the least integer making all numerators and denominators integral. By (20), 
Ptj(—Zi i )(d(Zi)) >l and q(—Z{ l )(d(Zi)) x are of degree nX. Consider the Jacobian of 
(21). Let A and Ai be the inverses of the Jacobians of (18) and (21) respectively. 
Notice that A and Ai are polynomials, for otherwise there would exist some point 
for which the Jacobians vanish. Then we have 

△ 你 ) =△(-z 1 - i )(d(z 1 )r +i , 

since the Jacobian of Z = —Zr 1 is (d(Zi)) _ ( n+1 ). Since ^(0) 0, we have △ ⑼ 〆 0. 

Consequently Ai(Zi) is a polynomial of degree n(n + 1). 

Without loss of generality, we may now assume that pij and q are polynomials 
of degree n\ and that the inverse of the Jacobian of (18) is a polynomial of degree 
n(n + 1) and its term of highest degree is equal to a constant multiple of (d(Z)) n+1 . 
The highest terms of and q are also constant multiples of (d(Z)) A . 

(3) We decompose q into irreducible factors 

Q = Q \ 1 ( 22 ) 

Now we are going to establish that (qi - - qi) n+l divide A. By some easy transfor¬ 
mation we may suppose that q\ does not divide p nn . We consider the product of the 
transformation (16) with r = n- 1 and (18). The inverse of the Jacobian of the new 
mapping is equal to 

(p nn /g) n+ 1 A, 

which is a polynomial. Thus g" +1 divides △_ Therefore we have the assertion. 

(4) Since d(Z) is irreducible and 

A(Z)= c(d(Z)) n+1 + •••, 
we have immediately that l = 1. Therefore 

q(Z) = (ad(Z) + _••)' 

Further (16) with r = n - 1 carries p nn into the denominator, then we have also 
Pnn(Z) = (a nn d(Z) + 

The mapping cannot be one-to-one except when A = 1. 

Now we may assume that 
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g(Z) = ad(Z) + ■■■ , (23) 

p ij (Z) = a ij d(Z) + ■■- (24) 

and 

A(Z) = c(q(Z)) n+1 . (25) 

(5) Now we shall prove that puPjj — Pij is divisible by q. In fact, by (16) with 
r = n — 2, we have 



If q does not divide p n -in-iPnn - Pnn-i> the manifold g = 0 is mapped into a 
manifold of dimension not greater than n(n + 1) — 3, which is impossible. Similarly, 
every three-rowed principal minor of (p^) is divisible by q. 

(6) Since 

A(-Z- 1 )(d(Z)) n+1 = Ai(^), 

we have 

q(-Z- l )(d{Z)) = gi {Z), 

where qi (Z) is a polynomial. By means of (16) with r = n — 1, we may find that 
pu(-Z- l )d{Z) 

is also a polynomial. Further, by means of (16) with r = n — 2, we find that 

p^-z-^-z-^-pK-z-^) 

is also a polynomial. Thus 

p^-z^mz )) 2 

is a polynomial, and so is pij(—Z~ 1 )d{Z). Therefore we have to find the polynomial 
p(Z) of degree n such that 

p(-Z~ l )d(Z) 


is a polynomial. 

(7) The answer to the question raised in (6) is that if we put 

p(Z) = J2p^(Z), 
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where p( fc ) is a homogeneous polynomial of degree k, then p( fc ) (Z) is a linear combi¬ 
nation of the fc-rowed minors of Z. This will be proved in the next section, owing to 
its independent interest. 

(8) Now we have, instead of (23) and (24), the following expressions 

ti—l 

q(.Z) = ad(Z) + y^g (fc) , a 5^0, 

fc =0 

n—1 

Pij (Z) = aij d(Z) + J2P^' 

fc =0 

where g ⑻ and are linear combinations of the fc-rowed minors of Z. We may let 

a = 1. Let be the cofactor of Zij, then 

? (n-i) = E 

There exists 5 such that 

q(Z - S) 

contains no term of order n - 1. Thus, we may assume that g (n_1) = 0. 

Further the transformation 

X = W- (a 0 ) 

carries (18) into a new one with Ojj = 0. Up to the present, (18) takes the form 


n-2 



q(Z)=d(Z) + ， £q^ 

k=0 

(26) 

and 

n—1 



Pii( z ) = ^Pij^ 

(27) 


fc=o 

where and pg) are linear combinations of the fc-rowed minors of Z. A is given 


by (25). 

Since the singular matrices form a manifold of dimension n(n + 1) — 2, there is a 
nonsingular matrix Z carried into a nonsingular matrix W. Without lose of generality 
we may assume that 

i = r(/). (28) 

pir 1)= E 叫，， <， 


(9) We write 


(29) 
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We shall prove that (29) forms a system of independent equations. In fat 
of the Jacobian of (18) with (26) and (27) is, by direct verification, not 

(_ 2 ) n(n 广 1) _ j _ _ n („ + 1 )- 1 . 

the last “ 一 1™ appears, as the highest terms are dependent. Then A is of degree 
less than n(n + 1) + 1, which is impossible. We have therefore the assertion. 


*)) = 5Z ^'at = 


consequence that d(Z) divides 


I p!r” n _1) I 


1) n (n-l) 叫 

,)To - d pT 1 ) 戍 _” 1 


Ur 1 ) 戍 _1) pir” 


0, we find that (30) is of rank not greater than 1. In partic 
An are of rank 1, as that A„ cannot be of rank 0 has been ! 


An = (««,••• ,ai n )'{an, ■■- ,ai„). 


By (28), we have 


Thus AY{Z)A' carries (30) into a new one with 

乂《 = [0，...，0，1，0，-. ，0] 

(where there are i — 1 zeros preceding the one). 


Pu ^ = z u + 5 Z P22 ” = A + 

1 . 



Since d(Z) divides 

nu-wr 1 )) 2 ， 

we write 

pn 1 ) - (Mr 1 )) 2 = d(Z)g(Z). 

By the lemma given in the next section, Pu -1 ^P 22 _1 ^ — (Pi 2 _1 ^) 2 ^ a lin ear combination 
of the two-rowed minors of 

( 4 ). 

We write 

Pi2 _1) = S hi i z ii- 

If P 12 contains (i,j ^ 1,2) then P 11 P 22 must contain x^Xjj, which is impossible. 
Therefore 

” = 士 

Consequently, 

?11 -1) ?22 _1) - (P ( i2 _1) ) 2 = *11 Y1 ^*3 + z ii S + 
is a multiple of d(Z). This is possible only when c = a = 0. Thus we have 
Pi" _1) = 4. P<" -1) = ±«ij- 

The transformation 

【 1,-1 ， 1, … ， 1]Z*[1,-1, ••• ， 1] 
carries z^ 2 into —z^, therefore we may assume that 

Since 

z \\ z 12 z 13 

z 12 ^22 一 2 h 

z 13 —z 23 z 33 

is not divisible by d(Z), we have 


^ = 4 - 
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Up to the present, we arrive at the conclusion that we may let 
n-2 

9 (Z)=d(Z) + ^ 9 W (31) 

fc =0 

and 

n—2 

Pij(z) = z-j + Pi^- ( 32 ) 

fc =0 

(11) We write 

PnP22 - P12 = 0 (mod g) 

in a more precise form 

PnP22~Pi2 = 9 ^. ( 33 ) 

where 

xj)- V> (n-2) + V» (n-3) + ••• 
and ^( fc ) is a homogeneous polynomial of degree k. 

Comparing the terms of degree 2n — 3 in (33), we have 

^11P^" 2) + ^2P ( n' 2) - 2z 1 >ir 2) = d(Z)^ n ~ 3 l 


By the result which will be proved in (12) we have 

P Sr 2) =pir 2) =pir 2) =^ (n_8) =o. 

Now we are going to prove that P!?) ， pHp 这 ） and are all zero. Suppose that 

the assertion is true for a > p. Then 

z* n p^ + _ 2^ = d(Z)，- 1 ). 

By (12) we have p ( ,^ = p[^ = = 矽 (<7_1 〉= 0. Thus we have finally 


q(Z) = d(Z), Pij{Z) = z'j. 

The theorem is proved completely, except the verification of the following assertion. 
(12) For n — 2 > a > 0, from 

zl lP ^ + z 2 * 2 P^- 2^> = d(Z)^\ 

we can deduce that = ^ _1) = 0. 
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To prove this assertion, we make a transformation Z = W -1 . Then 
4= 如 P^\Z) = r^-°\w)/d(W), 

= (/> {n - a+1) (W)/d{W), 

we have 

U, n r 22 -<T) + ^72^~ 0) - 2wnr[2~ a) = 

where 沴 ( T ) is a sum of t = (n — <r + l)-rowed minors. Let 沴 i be a T-rowed minor 
contained in <j>. Putting all elements of W other than those contained in <j) equal to 
zero, we find that </>i, a determinant of order r, may be expressed as 

VJ12T22 + W22TU - 

where ru, r 12 , r 22 are sums of (r - 1)-rowed minors of <(n. This is impossible for 
n > 3 as shown by the fact that the determinant vanishes identically for Wii = W \2 = 
W 22 = 0. Thus we suppose that r = 3. Since 1022 contains no term with factors w\i 
and W13, we have 

»*22 = Cx{W22VJ33 - V>\z) 



ru = C2(wnW33 - taf 3 ). 



C\(W22W33 - W23)wu + C2(wnWi3 - « ； i 3 )t^22 - 2^12^12 
=t0n« ； 22^33 - ^23^11 ~ ^13^22 + • • • 

implies 

Cl(1022^33 - W2 3 )Wu + « ： 2(«»11 切 33 一《 »13) 2 叻 23 
= W»lltt>22t«33 — 1^23^11 ~ W \3 W 22 (mod« ； i2 )， 
which is evidently impossible. 

9. A result concerning adjugate 

Now we are going to verify the assertion stated in (7) of the previous section. 
Let 

•X" = (Xij) 

denote an n-rowed symmetric matrix and let 

y = (y«) 
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be its in verse. Now we consider the following problem: find the polynomial f(Y) in 
yij satisfying 

f(Y)d(X) = g(X), (34) 

where 夕 (X) is a polynomial in Xij. Since d(X) is a homogeneous polynomial, we need 
only consider the homogeneous part of f(Y) as well as p(X). Thus our problem is 
reduced to finding homogeneous f(Y) satisfying (34). Let l be the degree of f(Y) 
and 

f(X) = p(pii ， … ， PnOyf" •••ySn"- (35) 

pn+ … +Pn»=J 

Then g(X) is of degree n — l and can be written as 

9(X)= 53 9(91 1 ，…， • • a: nn n - (36) 

«l+ … +9nn=»>-t 

Let 

x* = (x：,.) = d{X)Y. 

Then (34) may be written as 

X] P(Pu， …… C" n 

Pll+—+Pnn=fc 

=(d(X)) l ~ l Y 1 ，…， 9-Mi 1 (37) 

9llH ■… +flnn=«-J 

Notice that the relation (37) is a reciprocal one, in fact, from (37)，we deduce 

also 

^2 9 ( 911 ， … ， gnn)®!? 11 - - - <? T n 

911+ … +9nn=»>—1 

= (d(X))"- i - 1 P(Pll ， --- ， PnnKt 1 --<" n "- (38) 

Pll+--+Pnn=»> 

Does (37) have a solution? It is known ① that a minor of order l satisfies our 
requirement. The purpose of the present section is to establish the converse, namely ： 
Theorem 10 (37) holds if and only if (36) is a linear combination of the (n-l)- 
rowed minors. Consequently, (34) holds if and only if g(X) is a linear combination 
of the minors of X. 

Proof The theorem is evidently true for Z = 0, l,n and n — l. Consequently 
the theorem is true for n = 1,2, and 3. 

① Wedderburn. Lectures on matrices. Amer. Math. Soc. Colloquium Publications, vol. 17, 
p.67, formula (20). 
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Let n ^ 4 and n/2 ^ ^ n — 2. We write 

g(X) = gi (X) + g 0 (.X), 

where gi(JV) vanishes for a：n = x \2 = … = xi n = 0. Putting 



we find, by comparing the homogeneous parts, that 如(义） = 如 (Xi) satisfies (37) with 
Z — l,n — 1 instead of l,n. By hypothesis of induction, 卯 (X) is a linear combination 
of (n — I)-rowed minors of Xi. 

Let 

9i(X) = 92(X) + 5i,oPO ， 

where gt{X) vanishes for 

a：n = Xl2 = • • • = xi n = 0 

and for 

Xl2 = *22 = ••• = l2n = 0- 

Then 510 (-^) is also a linear combination of the (n — Z)-rowed minors of X. 
Proceeding successively, we have 

g(X) = go(X) + gio[X) + • • • + gno(X) + V*(^0 ， 
where ip(,X) vanishes for 

X11 = X12 = ••• = *in = 0 > 
for 

X 12 =X22 = - = X2n = 0. 


and for 

Xin = X2n = … = X nn = 0. 

Since the degree of i>(X) is n -1 ^ n/2, it is only possible for l = n/2, and 
contains only terms of the form 

0^12*34 • • ' ®n-ln- 

The term (in ••- x„ Il ) J-1 (a；i 2 a ：34 - - - *n-i«) cannot appear on the left side of (37). 
Therefore we have the theorem. 
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of the same signature as H form the elliptic space with signature {p, q). The symplectic 
transformation 

(Z,,Z 2 ) = Q(Wi, W 2 )F, F^F r = $ 
is called a motion of the space. 

Theorem 11 Any nonsingular symmetric pair of matrices (Wi, W2) making 
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From Theorem 11， we may easily find that 

ddWuWi^OV, ， W 2 )'H)^0 (41) 

for any nonsingular pair of matrices (Wi ， W 2 ) and that the equality holds on a mani¬ 
fold of dimension not greater than n(n + 1) — 2. Now we shall go further to establish 
the following theorem. 

Theorem 12 If 

d((W^Wi)W，W 2 )0 = 0, 

then (W\, W 2 ) lies on a manifold of dimension not greater than n(n +1)—3. 

Proof By the consideration of semi-involutions, we have only to consider the 
nonhomogeneous expression, that is, we are going to prove that the symmetric ma¬ 
trices Z satisfying 

d(H + ZHZ) = 0 (42) 

form a manifold of dimension not greater than n(ra + 1) — 3. 

The equation (42) implies that HZ~ l HZ~ l has at least a negative root, if ^ is 
nonsingular. By a theorem due to the author ① those Z form a manifold of dimension 
not greater than n(n + 1)—3. 

The equation d(Z) = 0 does not imply d(H + ZHE) = 0 identically. The 
manifold defined by 

d(Z) = d(H + ZHZ) = 0 

is therefore of dimension not greater than n(n + 1)-3. 

Theorem 13 (Fundamental theorem of the elliptic space) An analytic auto- 
morph oj the elliptic space of signature (p,q) is a motion of the space. 

Proof Let 

w = r(Z), r = (fa) 

by an analytic mapping of the elliptic space onto itself. Then fij{Z) is analytic in 
the whole extended space of symmetric matrices except possibly on a manifold of 
dimension not greater than n(n + 1)-3. By the continuity theorem® of functions of 
several complex variables, fij(Z) is analytic in the whole extended space. Its inverse 
mapping is also analytic everywhere. The theorem follows from Theorem 12. 

① Ibid. Theorem 6. 

② Levi, ibid., or Sate 17, Folgerung 1 of Behnke and Thullen. Theorie der FunkUonen mehrerer 
komplexer Verdnderlichen. Julius Springer, 1934. 
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11. Hyperbolic space 

The hyperbolic space is formed by the symmetric matrices Z making I — ZZ 
positive definite. A symplectic mapping carrying the space onto itself is called a 
motion of the space. 

Theorem 14 (Siegel) An analytic mapping carrying the hyperbolic space onto 
itself is a motion of the space. 

For completeness, we give here a proof which is different from that due to Siegel. 

Proof (1) The space is transitive, hence we need only to consider the transfor¬ 
mations keeping 0 invariant. By a theorem for circular regions proved by H. Cartan ①， 
the mapping is a linear one. Let 


W= AraZr »' Ara = A， rs (43) 

be the mapping. Since I — ZZ = 0 is the intersection of all the algebraic surfaces 
bounding the space, the mapping (43)，therefore, carries unitary symmetric matrices 
into unitary symmetric matrices. 

Since any unitary symmetric matrix can be expressed as UU' where U is unitary, 
we may assume that (43) carries I into itself. Consequently, we have 

^ Arr = I - ( 44 ) 

r=l 


Putting Z = ，…， e*®"], we have 

I = WW= ) (^Arre 

\r=l / \r=l 

for any real 0. Consequently we obtain 

ArrAat = 0 for T ^ S. 


(45) 


(2) From (44) and (45), we have 


A rr —* Arr 


[P")- 


ArrArr = | 〉: A aa j A rr = 3 rr _ 


Then A rr are real and A TT A sa =0,i4? r = A^. Therefore, we have a real orthogonal 
T such that 

- - , 0 , 1 , 0 , -•• , 0 ] 


① Journal de MathSmatigue, 1931, 10(9), Theorem 6. 



Geometries of matrices. III. Fundamental theorems in the geometries of 


149 


(where the 1 in the brackets is in the rth place). Without loss of generality we assume 
that 


A rr = [()，•• - ，0,1，0, …， 0] 


(46) 


(where the 1 in brackets is in the rth place). 


(3) Now we put 



e 访 
0 


J+[l, 


From (44) and I = WW, we have 


A\iA\2 + e t9 Ai2{I — An — An) + (/ — -^ii — A22)^~ x0 M2 
+(( / — An — An ) 2 — I, 


for all 9. Thus we have 


where aa = J( 2 ). 

(4) Further we let 


A\2 = a( 2 ) + 0， 


(e^/2^ 1/2^ ) 

!/2 1/2 - e - iT /2 " 2 厂 

From (44) and I - WW = 0, we deduce immediately that 

—((： 二; nr : 二 ： h 

= 2 -w + ( 了二 )+ ( :二)如 -w. 


Consequently 


e~ iT 0 
0 -e iT 


H 


e iT 0 
0 -e~* r 


Similarly A ra is determined completely apart from a sign. We may assume that A u 
takes the positive sign. Since 
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being unitary does not imply that 

( 2ll ^12 Z13 \ 

«X2 «22 —^23 

213 —223 Z 33 J 

is unitary, the theorem is now established. 

Finally, the author gives the following theorem which shows the importance of 
the notion of the characteristic roots of the distance-matrix. 

Theorem 15 A mapping carrying the hyperbolic space U I — ZZ being positive 
definite” into itself, and keeping the characteristic roots of the distance matrix between 
two points invariant, is either a hyperbolic motion or a hyperbolic motion combined 
with a reflexion Z = W. 

Proof The proof is comparatively simple, and it contains some repetition of 
our old argument. The author gives only the main procedure of the proof. 

(1) The distance matrix 33(A,B) of two points A and B of the space is defined 
by 

V(A,B) = 

In particular for S = 0, we have 

H{A,0) = AX. 

(2) Owing to the transitivity, we need only consider the mapping W = T(Z) 
keeping 0 invariant. Thus both Hermitian matrices WW and ZZ are conjunctive. In 
particular the mapping keeps the rank of Z invariant. 

(3) Let 

r(ll/2,0, •• - ,0]) = (oi, ， On)’(ai ， ". ， a„). 

By a slight modification, we may let 

r([l/2,0 ， .. ， 0]) = [l/2,0 ， ". ， 0l. 

Prom the invariance of the characteristic roots of distance-matrix we find 
rdALO,- - ,0]) = [A 1 ,0, -- ,0], 

for all real 入 i. Without loss of generality, we may modify T such that 

r([o, a 2 ,0, • • •, 0]) = [0, a 2 , 0, • • •, o], 


r ([ o , o , …， 0 ，入„]) = [ 0 , 0 ，一， 0 ，入„] 
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for all real 入 2 ,…，入 n- 

Next, we may show that for any real diagonal A we have 

r ( A ) = a 

and that for all real X, we have 

r(X) = X. 

(4) Since W = T(Z) and 

r(X,W) = r(X,Z), 

we find that 

\d(X- W)\ a = \d(X - Z)\ 2 
for all real X. We deduce that 

W = Z or Z. 



Some “Anzahl” theorems for groups 
of prime power orders* 

Abstract 

We define a group Q of order p n (p being a prime) to be of rank a if the maximum 
of the orders of all elements of Q is equal to p n ~ a . First the author introduces a 
“pseudo base”，which is the main weapon of the paper. Then the author proves the 
“Anzahl” theorem that if p > 3,n > 2a + 1, then (i) Q contains one and only one 
subgroup of order p m (2a + l ^ m ^ n) of rank a; (ii) Q contains p a cyclic subgroups 
of order (a < m < n - o + 1); and (iii) Q contains p m+Q elements satisfying 
GP m = 1. Next the author determines completely the ranks of all major subgroups 
and the exact number of the major subgroups of each rank. More precisely, the rank 
of a major subgroup of index is cither a — 0 + 1 ot a — /3, and for p 彡 3 and 
n ^ 2a + 1, the number of those of rank a- 0 + 1 and the number of those of rank 
a - 0 ase equal to and P 0 小 d-1,0 respectively, where d is the number of 

mutually independent generators of the group and for 0 < r ^ 1 /, 

(p^-1)- 

知 T _ (pr_l)...(p_l) - 


Then the author deduces a more precise formula than the enumeration principle due 
to Hall. This is the second weapon of the paper. By these two weapons, the following 
Anzahl theorem is obtained: iip 彡 3 and n 彡 2a+l, then the number of its subgroups 
of order p m ( 2 a - P < m <n - 0 + 1 ) and of rank a -0 with 0 < 卢彡 d is congruent 
to (mod p d ~ 0+l ). For 0 = d, p^<f>d-i ,0 is to be replaced by zero. In the 

later part of the paper, the author selects the following typical one： for p > 3 if ^ 
contains jp cyclic subgroups of order for a fixed m satisfying m > /? + 1, it is of 
rank 0 . 


Introduction 


For convenience, we shall define a group Q of order p n (p being a prime) to be 
of rank 5 if the maximum of the orders of all elements G of 0 is equal to p" s . 

* Reprinted from The Science Reports of National Tsing Hua University, Series A, 1947, 
4 (4-6 )： 3X3-327. 
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Throughout this paper, we shall always denote by p an odd prime. 

In §1, by introducing a ^pseudo-base", we shall prove the following “Anzahl” 
theorem: 

If Q is of order p™ (p > 3, n ^ 2 q + 1) of rank a, then (i) Q contains one and 
only one subgroup of order p m (2a + 1 ^ m ^ n) of rank a; (ii) Q contains p a cyclic 
subgroups of order p m (a < m < n — a + 1); and (iii) Q contains p m+a elements 
satisfying 

= 1 . 

The second and the third statements of the above theorem yield better results 
than the well-known theorems due to Miller and Kulakoff respectively. 

In §2, the ranks of all the major subgroups and the exact number of the major 
subgroups of each rank will be determined completely; more precisely, in a group Q 
of order p n of rank a, the rank of a major subgroup of index p 0 is either a- 0+1 
or a- 0, and for p ^ 3 and 2a + 1, the number of those of rank a —(0 + 1 and 
the number of those of rank a - /? are equal to and respectively, 

where d is the number of generators of the group and for 0 ^ r < i/, 

_ 

队 ， T = ~(pr _ 1) . . . (p _ 1 ) 一 

By means of these results we shall establish a more precise formula than the enumer¬ 
ation principle due to Hall. 

Some precise results concerning the number of subgroups of order p m in Q will 
be given in §3, the chief theorem is the following: 

Let ^ be a group of order p n and of rank a with p 彡 3 and n ^ 2a + 1 • Then 
the number of its subgroups of order p m (2a — P<m<n — P + 1) and of rank a —/? 
with 0 < /3 ^ d is congruent to p^^d-i.p (mod p d ~ 0+l ). For 0 = d, p^iffd-up has to 
be replaced by 0. 

Among the theorems of §4, we shall state the following typical one which is a 
generalization of a result due to Miller and is in a certain sense a converse of a result 

of§l： 

A group of order p n (p ^ 3) which contains p 0 cyclic subgroups of order p m for 
a fixed m satisfying m> 0 + 1 is of rank 

For )8 = 0, the theorem is trivial, and for p = 1, vre have the case discussed by 
Miller. 

The author is indebted to Mr. H. F. Tuan for his revision of the manuscripts. 
Some new results of Kulakoff’s type have been obtained by him. 
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；1. Theorem 1 Let Q be a group of order p n and of rank a vdth p ^ 3 
n ^ 2a + 1, then we have the following conclusions: 

(i) There exist a + 1 elements A,A\, ■■- ,A a of Q such that every element G of 
Q can be expressed uniquely as 

G = A^-A^A X , 1<A 0 <?,••• 

where the order of does not exceed p s and that of A is exactly p n ~ a (pseudo-base)) 

(ii) The equality 

{BMp 01 = BfBf 

holds for any pair of elements Bi and B2 of Q\ 

(iii) The commutators of Q are of order < p a , and the simple 3-fold commutators 
of Q are of order < p a_1 ; and 

(iv) A p ° belongs to the central of Q. 

Proof For ar = 0 the theorem is trivial. For a = 1, the theorem is true®. 
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where C belengs to 5JJI, since is normal in Q, By (ii')，we have 

Further, C p ° 1 belongs to the cyclic subgroup {^4}, for otherwise the group {C,A} 
would be greater than 9JI. We can let 

since the order of C cannot exceed that of A. Then 

B-U^B = 


Consequently by (iv 7 ), 

AP a ~' = B- p A pa ~'B p = A pa ~' il+b)P . 


Therefore 

(1 + 6) p = 1 (mod p n - 2a+l ). 

(1) 

For n > 2a + 1, we have then 



6 = 0 (mod p n-2a ). 

(2) 

Thus 

c^° _, = ，… 1 c . 



i.e. (A, B) = C is of order < p°, a4id it follows that (B,A P ) is of order < p a_1 . We 
then obtain (iv) by (B, A p ° ) = 1 and (iV). 

Further, the orders of the commutators (B, Ai), i = 1,••• ,a - 1, axe < p 0-1 , 
since let 

A^B^AiB = Q {Ci in OH), 

we have 

1 = B~ 1 Af' 1 B = (A i Ci) pa ~ 1 = 

by (ii’〉and (i’). Let M be any element of OT, and let M = M\A X with M\ = 
■^a-V ''' 1 - Then since 

= MU X ) = (B, A X )A~ X (B, MM X , 

we have 

(B, M) pa = (B,A x ) pa A- pax (B,M 1 ) pa A pt，x = 1. 
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Further, if both B and B' are in Q but not in 9JI, then we have an element M of 971 
such that B' = B^M. Therefore 

(B, B') = B~ l M^B~ x BB x M = {B, M) 

is of order < p a . 

Now let K be any commutator, then K belongs to Wl and is of order < p a . Thus 
(K, C) is of order < p Q_1 for any C oi9Jl by ( 故 ). Finally we can write 

K = M 1 AP n ~ aay with Mx = … 冲 . 

Then, for any B oiQ not of 5301, we have 

(K,B) = 

By (ii’）and since (Af, B) p ° * = 1, we have 

(K, B) pa ~ l = 1, for n^2a + l. 

Hence the proof of (iii) is complete. 

It is easy to see by mathematical induction that 

where among these K's |p(p — 1) are (Bi, B 2 ) and the others are higher fold com¬ 
mutators. By (ii’）we then have for p 彡 3 

{B x B 2 Y a = B\ a Bl a Kf'' -KC 1 = «( 执，丑 2 々。 ( 卜 《 = BfBf. 

Therefore (ii) is true. 

If B is of order ^ p a , then B = A a satisfies our requirement of (i). If B is of 
order {0 > a), then 

B p = A^Ti - - • AP "_ a_fl+ 〜 with p + fi. 

Therefore 

〆 = A^n 

by (ii，）and (i，). Let _4 a = B~ 1 A pn ' a ~ l> (i, then 



orders of i4 a s'. 

Remark For n = 2a, (1) implies 6 = 0 (mod p). The theorem is still ture. 
Theorem 2 (Generalization of a result due to Kulakoff) The number of ele¬ 
ments of order < p m (a ^ m ^ n - a) of a group of order p n (p ^ 3, n ^ 2a + 1) of 
rank a is equal to p m+a . 

Proof By Theorem 1 (i) and (ii), the elements of order < p m are of the form 
- ■■A^A pn ~ a ~ n ' i , - , l^Ax^p, 1 《 M < P m , 


There are p m+a such elements. 

Theorem 3 (Generalization of a result due to Miller) The number of cyclic 
subgroups of order p m (a < m < n-a+1) of a group of order p n (p ^ 3,n > 2a + l) 
of rank a is equal to p Q . 

Proof The number of elements of order p m of the group Q is p a <p(p m ). There¬ 
fore we have the theorem. 

Theorem 4 The group of order p n (p ^ 3, n ^ 2 q+ 1) of rank a contains one 
and only one subgroup of order p m (n > m ^ 2a) of rank a. 

Proof The set of elements 

M = A^- -A^AP y , 0 ^ A a ^ p-1, • • • ,0 < Ai ^ p-1, 0 ^ A 7 ^ p^^-l (3) 

forms a group, since (Af, A p ) is of order < p 0_1 , i.e (M,A P ) = MiA Xp with M\ = 
4 二 l • • Further (3) contains ail elements of order < p n ~ a ~ l . The theorem is 

then proved by induction. 

Remark For n = 2a, Theorem 4 is then trivial. 

§2. Let Q be any p-group of order p n and of rank a and let 931^ denote a typical 
major subgroup of index jP m Q with 0 ^ ^ d = d(Q), where d is the number of 

mutually independent generating elements of Q. In particular, Q = 971 0 , D = tXHd = 
the principal subgroup of Q. 

Theorem 5 If Q is of mnk a, then the rank of Jf, which is a subgroup of Q 
of index p^, is^ a — 0 and ^ a. 
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2) Evidently belongs to 3^ for all elements G of Q. If 3V is of rank 7 , then 






Therefore n — a < n — 7 and 7 < a. 

Theorem 6 If Q is of rank a, then the rank of any is ^ a — 0 + 1. 

Proof Suppose 9% is of rank 7 . Then 

( G P)P n - s - r = 1 

for all elements G of Q, since G p belongs to Therefore n, — /3 — 7 + 1 ^ n — a 
and 7 < a — 冷 + 1 . 

According to Theorems 5 and 6 , we can classify the major subgroups Tig into 
two classes. 

I. Those 9% of rank a — /? + 1; the typical one will be denoted by 9%: ; 

II. Those 9Jlff of rank a- 0; the typical one will be denoted by 

Theorem 4 asserts that there is one and only one provided p > 3, n > 2a+l. 
Theorem 7 Let p 彡 3 and n ^ 2a+ 1. The subgroups of Q which lie between 
QTli.i and & are those of I and conversely. 

Proof If Tip lies in which is of rank q, the rank of Tig is > a — (/3 —1)= 
a —(3 + 1, by Theorem 5. By Theorem 6 , the rank of VJlp is equal to a — /3 +1. Hence 
3% lies always in I. 

Next we shall prove that every Wlff, lies between 3Jln and This is trivial for 
/3 = 1. Hence we suppose P ^ 2. There exists an which contains SOl/ 3 , 1 . If 

belongs to I, i.e. 97l^_i = the theorem follows by induction. Hence 

suppose that £01 ^-1 belongs to II and we write 9 Jl/ 3 _i = ^ 0 - 1 , 2 - The intersection 
of QTli.i and 3% 一 1,2 is of order p n ~ 0 and of rank a — 0 + 1, since 1,2 is not a 
subgroup of by the direct part of the present theorem and 
(order of aHi.i)(order of ^- 1 , 2 ) = order of g 
order of OTi.i A 

OT p .- l , p n- g+ l 广 

order of ®li，i 八 £ 01 / 5 - 1,2 

Now £ 010 - 1,2 of order p n_/3+l and rank a - (0 - 1), has one and only one subgroup 
of rank a — 0 + 1 and of order p n ~ 0 by Theorem 4forn — /3 + l>2a — ,5 + 2> 
2(a — /3 + 1) + 冷一 1> 2(a —+ 1) + 1, the intersection should be SJJT/s, 1 . Thus 


OTi,i A OT^-1,2 = 9%i- 
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Consequently is a subgroup of The theorem is thus completely proved. 

Corollary 7.1 Ifp ^ 3 and n ^ 2a +1, then the principal subgroup of a group 
of order p n and of rank a is of rank a — d + 1 . 

Corollary 7.2 Ifp > 3 and n > 2a +1, then a group of order p n with a cyclic 
principal subgroup of index is of rank a = d — 1. In other words, the number of 
members of a minimal base is equal to its rank plus one. 

Corollary 7.3 Ifp ^ 3 and n ^ 2 q + 1, the number of elements of a minimal 
base of a group of order p n and of rank a does not exceed a + 1 . 

Theorem 8 Let p ^ Z and n ^ 2a + 1. The number of is equal to 
tpd-1,0-1 and that is equal to 

Proof The number of is equal to the number of subgroups of order p n ~ 0 
which lie between 田 li,i and Now Abelian of type (1, ■ - , 1), we have 

d-l 

the first statement of the theorem. The second follows immediately since 
<t>d,p - <t>d-i,p-\ =P^<t>d-\,0- 
Theorem 9 Let p ^ 3 and n > 2a + 1. Then 
d{m 0 , l )^d{G)-l3 + l 

and 

d{m pa ) ^ d(g) - a. 


Proof It is evident that 9{Q) > hence we have the inequality 


d(m 0 ) ^ d{g) - 0. 


In particular, the last inequality is trivial and is true for arbitrary n. To prove the 
first inequality, we have merely to show that for n > 2 a + 1 , 


For suppose 9{Q) - 资 (OTi.i), then Corollary 7.1 shows that 迓 (OTi.i) would be of 
rank a-(d-l) + l= a-d + 2, since ra - 1 > 2a + 1 forn > 2a + 1. But the rank 
of 3>{g) is a-d+1. Thus ^{G) # ^(OTi.i), and the theorem is proved. 

Corollary 9.1 Let Q be a group of order p n and of rank a with p > 3 and 
n > 2a+l. IJQ has a cyclic principal subgroup of indexp d , then each major subgroup 
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has also a cyclic principal subgroup, and for n > 2a + 1, there is one and only 
one maximal subgroup which has also a cyclic principal subgroup of index p d . 

This is an immediate consequence of Corollaries 7.2 and 7.3 together with Theo¬ 
rem 9. The converse of the corollary is not true as is shown by the following “Gegen- 
beispiel ”： 

i4f nS = 1, = 1, (A it A2) p = 1, ((i4 1 ， _ < 4 2 ) ， i4i) = 1 ， ((Ai,A 2 ),A 2 ) = 1- 

Theorem 10 (Modified form of Hall's enumeration principle) Letff, 371/3,1 and 
501^,2 have the same meaning as defined above. Let 3? be any class whose members 
are elements or subgroups or (still more generally) sets of elements of G ， and let each 
member of K belong to at least one maximal subgroup of Q. Let denote the 

number of members o/Sl which belong to Then 


n (9 Wo ) - - n (肌， 2)+ P ( ^2 n (9 Jl 2 , i ) + 52 职 2 . 2 ) j 


-p 3 l Y, n ( OT 3,i)- S "(^3, 2 ) + • • • + (.-l) d p^ d ~ l) n^d) = 0, 

\(ffll 3 ,,) (OTa.a) ) 


the sum ^ being taken over all the subgroups Tl0,i of Q. 

The derivation of this theorem from that due to Hall is immediate. 

§3. Theorem 11 Let Q be a group of order p n and of rank q > 1 (consequently 
d > 2), with p 彡 3 and n ^ 2a + 1. Then the number of its subgroups of order 
p m (n > m ^ 2a) and of rank a - 1 is congruent to 

P0j-i,i =p + p 2 + ••• +P d ~ 1 (mod p d ). 

Proof For m = n - 1, the subgroups considered are those of SJJti. 2 , which are 
> n number, hence the theorem is true for m = n—1. In particular, the theorem 
is true for n = 2a+l. We shall therefore assume that n > 2a+l and n — 1 > m 彡 2a 
and apply mathematical induction on n. 

Taking 3? to be the set of subgroups of order p m and of rank a - 1, we have, by 
Theorems 10, 4 and 5 

n(9Wo) = + ^ 2 打 ( 职 i,2)-P ^2 n(OT 2 ， i). 

(®li ， 2) (Wla.i) 

By Theorems 8 and 9 and hypothesis of induction, we have 
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n(fXft 0 ) = • 1 • 1 (mod ^ 叫 . 1 )) 

= P<l>d~i,i (mod p d ). 

Theorem 12 In a group Q of order p n and of rank q (p ^ 3 and n > 2a + 1 )， 
the number of subgroups of order p n ~^ (d^ 0 ^0) and of rank a — 0 is congruent to 
p^<f>d-i,0 (mod p d_,3+1 ). Thus the number of nonmajor subgroups of order p n ~^ and 
of rank a — 0 in Q is divisible by p°~^ +1 (for 0 = d, p^<j)d-i,0 is to be replaced by 0 ). 

Proof There is no subgroup of rank a - 0 of order greater than p n ~ 0 by 
Theorem 5. 

The theorem is trivial for = 0 and it is true for /? = 1 by Theorem 8 . We 
shall therefore assume 0 > l and apply induction on 0. Taking 5? to be the set 
of all subgroups of order p n-/S and of rank a — /?， we have, by Theorems 10 and 
5 (hence OT 7 ,i contains no subgroup of rank (a — 7 + 1) — (/3 — 7 + 1) of order 
> ph-lO-W-l+l) 

n(97lo) = E n(5Bl ]) - p 二 n(37l2) +p a 

(£<«,) (®l a ) (®ls) 

-(-l, + 1 p ⑽ +1> [ n(®l^ +l ) —— 

= n (® l l,a) ~ V 5 Z n ( 职 2,2) + P 3 ^2 n (®^,2) - 

(ORi.j) (OTs.a) 

52 科， 2 ). 

(®w) 

Now © 1^,2 is a group of order p n ~"* and of rank a — 7 , and 71 ( 551 ^, 2 ) is equal to the 
number of subgroups of order ⑺ _7 ) and of rank (a— 7 )—(^- 7 ) with ^—7 < 0, 

further n -7 ^ 2 (a~ 7 ) +1 and 0 < ^ d-^r < ^(£^, 2 ), we have, by hpyothesis 

of induction and by Theorems 8 and 9 


(9R,, 3 ) 

=_ (_l)-r p i7(-r-D ^ (mod/) 

(<5 = - /?-f 7 + 1 + ^7(7 - 1)^ 

= -(一 ^2 P /3_ ， T ^d-T-l,0-7 (modp d ^ +1+ ^^-^) 
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=一 (-1) 7 P= 7(7-1) Y 1 (mod p d ~ 0+1 ) 

(an-,.2) 

三一 (-l) 7 p^ 7(7 "* 1) p 7 ^d-i,iP^ _T ^<t-7-i,/3-7 ( mod P d ~ 0+1 ) 

=~ (~l) 7 p^ 7( ' T "' 1 V^d-l,-y<Ad-7-l,/3-7 ( mod P d ~ P+1 )- 
Consequently 

n(OT 0 ) = jP (j>d-i,i<l>d- 2 ,p-i - P<t>d-i,24>d-3,fi-2 +- 

=P^d-i ，0 (mod p d ~ 0+1 ), 


and the theorem is thus proved. 

Corollary 12.1 Let Q have a cyclic principal subgroup of index p a (conse¬ 
quently it is of rank a = d — 1). Then the number of subgroups of order p n ~^ (d ^ 
/3 > 0) and of rank a — 0 is equal to p^<f>d-i, 0 - 

Proof Since for this case, d{Tly, 2 ) = d - 7 , we have everywhere equalities in 
place of former congruences. 

Theorem 13 Let Q be a group of order p n and of rank a with p 彡 3 and 
n > 2o + l. The number of its subgroups of order p m (2a — 0 < m < n — 0+1) and of 
rank a-/J(0</?<d) is congruent to p^ipd-i.p (mod p d ~ 0+1 ) (for 0 = d, j^<h-i ，0 
has to be replaced by 0). 

Proof Theorem 12 asserts that the theorem is true for m = n — /9. Thus, in 
particular, the theorem is true for n = 2 q + 1. By Theorems 4 and 11, the theorem 
is true for /? = 0 and 0=1 respectively. Hence suppose n > 2 + 1, m < n — 0 and 
0 > 1, and prove by induction on n and 0. Let Si be the set of all subgroups of order 
p m of rank a - 0. By Theorems 8, 9 and 10 and hypothesis of induction, we then 
have as before 


n(!W 0 ) = + n ( m h 2 )~P Y1 n (职 2 ， i)_P E n(9Jl 2 , 2 ) + . 

(SWi.j) (3«a,a) 

= J^{<Pd-l,0 + <l>d-l,l4>d-l,0-l - + ••• 

- (- 1 )-l.T 

-+ •••) 

三 (mod p d ~ 0+l ). 


hence the theorem follows. 

Corollary 13.1 Let Q have a cyclic principal subgroup of index p d {conse¬ 
quently it is of rank a = d — 1). Then the number of its subgroups of order p m with 
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2a — /3<m<n — 0+1 and of rank a — 3 with 0 < < d is equal to p^(l>d-i, 0 - 


Proof Since ^(£^, 1 ) = Q —7 + 2 = d — 7 +I and rf( 9 Jty， 2 ) = rf — 7 , we have 
everywhere equalities in place of the former congruences. 

Corollary 13.2 In a group of order p n and of rank a with p 彡 3 and n ^ 
2a + 1 , the number of subgroups of order p m and of rank a — 0 with 0 ^ 0 ^ oi and 
2a-P<m<n-0 + l is congruent to p 0 (mod p^ +1 ). 


Proof For 0 彡; 0 彡 


d' 

2 j' 


the corollary is true by Theorem 13. Hence we can 


assume that /3 > J and apply induction on m. 
For m = n — 0, we have then 


n(9Jlo) = n(£Wi,i) + ^ 2 n(OTi ， 2 ) -p^- 卩 7»( 职 2,2) + - 
= O + p^j-i.iCp 0-1 + •••) — 0 -p■ p 2 <t>d-2,2(j^~ 2 + •••) + •• 
=〆 (mod p^ +1 ). 


Hence the corollary is true for m = n — /3 and is thus true for n = 2a + 1. We can 
then assume n > 2 a+ 1 , also m<n — P, and apply induction on n. 

Now as before 


n(OJlo) = n(9Jl ： i ， i) + 53n(3Ri l2 ) - P^Z 71 ^ 2 - 1 ) _ p 52 n ( OT 2 , 2 ) + … 

=p^ + P^d- 1 , 1 〆 -1 - P^d-i.iP 5-1 ~ P' pVd-uP 0-2 + … （mod ^ +1 ) 
=jP (mod p^ +1 ), 


hence the corollary. 

By the above corollary and Theorem 5, we have 

Corollary 13.3 In a group of order p n and of rank a with p ^ 3, the number 
of subgroups of order p m and of rank a-0 with 2a- 0 <n is always congruent to 
0 (mod p^). 

§4. Theorem 14 The number of subgroups of order p m of rank (p > 3, m > 
2/3 +1) in o group of order p n (n > m) of rank a {a > 0) is congruent to 0 (mod p). 

Proof For m = n, the theorem is trivial, hence we assume m < n. For 
n <£» + /? + 1 ， the theorem is true, since the group would then contain no sub¬ 
group of order p m of rank 0 (m > 20 + 1) by Theorem 5. For a = <5 + 1, we have 
n> m> 2(3 + 1 = 2a —and therefore the theorem is true by Theorem 11. Hence 
assume n > a + P + l and a > /3 + 1 and apply induction on n and 0. By the 
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enumeration principle, we have therefore 

n(®l 0 ) - ^2 n (® l i,i) + ^2 «(®li ， 2 ) (mod p); 

the by hypothesis of induction, n(SPlo) = 0 (mod p). 

Theorem 15 The number of cyclic subgroups of order p m (p > 3, m > /3) *ra a 
group of order p n (n ^ m) of rank P is congruent to 0 (mod p 0 ). 

Proof For m = n, the theorem is trival, hence we assume m <n. For n ^ a+/3, 
the theorem is true since the group would then contain no cyclic subgroups of such 
orders by Theorem 5. For a — 0, the theorem is true for /? < m < n — a (then 
n>m + o：>o + ( 5> 2a) by Theorem 3 and for m > n — q by Theorem 5. Hence 
assume n> a + 0 and a > 0 and apply induction on n and 0. By the enumeration 
principle, we have 

n(£OTo) = y^n(9Jti,i) -fy^n( 9 Jti, 2 ) -p^n(!®r 2 ,i) - Pn(OT 2 , 2 ) + … • 

By hypothesis of induction, we have then 

n(OTl 0 )=- (-l) a -^ + 1 pi (a ^ +1)(a -^ 53 n(OTl Q _^ +1 , 2 ) - (- 1 广 + 2 

X pi(a-/3+2)(o-/3+i) Z n (OT a _0 +2 ,i) + … (mod p 5 ). 

Now for 7 ^ 2a — + 1, we have m > (rank of hence by Theorems 3 and 5, 

nCWly.i) = 0 (mod p 0- ^ 1 ), «(3^, 2 ) = 0 (mod p a ~ 7 ) 

and for 7 ^ 2 a - /? + 1 , we have also 

!7(7 - 1) + a - T = ^7(7 -2,) + a> P, 
so the theorem is proved. 

Theorem 16 If a group of order p n (p > 3) contains one and only one subgroup 
of rank (3 of order p m for a fixed m, m> 2/3 + 1, then it is of rank p. 

Proof If the rank of the group is less than P, then by Theorem 5, it contains no 
subgroup of rank 0. And if the rank of the group is greater than p, then by Theorem 
14, the number of subgroups of rank 0 is divisible by p. Hence the theorem. 

Theorem 17 If a group of order p n (p > 3) contains exactly cyclic subgroups 

of order p m for a fixed m, /0 + 1 < m < n, then it is of rank 0. 

Proof Let 7 be the rank of the group considered. 





Theorems 3 and 5, the group would 
r p m , according as m ^ n — 7 (then 
n ^ to + 7 > 27 ) or m > n — 7 . Hence we get a contradiction. 

2) If 7 > P, then 7 ^ ,0+ 1. By Theorem 15, the number of cyclic subgroups of 
order p m (卢 + 1 < m < re) would be divisible by p^ +1 . Hence we get a contradiction. 
Thus the theorem is proved. 
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1 the automorphisms of the symplectic group over 
any field* 

Introduction 

Let $ be any field with characteristic ^ 2. Unless the contrary is stated, we use 
ill Latin letters to denote elements of the field and capital Latin letters to denote 
n-rowed matrices over 少 . For matrices which are not n x n, we use to denote 
an to x m matrix. / and 0 denote the ra-rowed identity and zero matrices respectively, 
and [ai, • • • , a n ] will denote a diagonal matrix with ai, ••- ,a n on its diagonal. We 
use small Greek letters for automorphisms of the field, i.e. 

(a + &)° = a a + b°, (ab) a = a a b° 

for a and b belonging to 
Let 



The group formed by the 2re-rowed matrices T satisfying 

^ = S, ( 1 ) 

where V denotes the transposed matrix of %, is called symplectic and is denoted by 
Sp($, 2n). 

Apparently, there are three different types of automorphisms X(*T) of Sp($,2n): 

1) Inner automorphisms. For a symplectic ^3, we have 

AIX) = 屯 iqr ' (2) 

2) Automophisms induced by those of the field: 

-4(X) = T 7 ; (3) 

3) Semi-inner automorphisms: We define the group formed by the matrices 屯 
satisfying 

q}SV f =aS ⑷ 


* Received November 7, 1946, Revised August 15, 1947. Annals of Mathemattics Vol. 49, No. 
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to be the extended symplectic group. Then, for 屯 satisfying (4), we have also an 
automorphism 

A(t) = <p _1 3 ： q3. 

Since 

= qj _l xq3 

and 

mmm= 祕 

we have only to consider those a which represent different cosets of the factor group of 
the multiplicative group of $ by the group $2 of square elements of More precisely, 
let 

a, 6,… 

be a representative system of - Then the elements of the extended symplectic 
group can be expressed as 



where ^3 is symplectic. 

It is the purpose of this paper to establish that the group generated by these 
three types of automorphisms is the largest group of automorphisms of Sp($ ， 2n), 
provided that the characteristic of $ is not equal to 2. 

In other words, analytically, we have the following 

Theorem All the automorphisms o/Sp($,2n) can be expressed as 

A(i) = qs 咒 rw- 1 屯 - 1 ， ⑻ 

where a runs over all automorphisms of the field, ^3 runs over all symplectic matrices 
and 

where a runs over a complete residue system of $/$ 2 - 

The corresponding problem for the special linear group was solved O. Schreier 
and B. L. van der Waerden® (see Appendix). 

① Die Automorphism der projektiven Gruppen. Hamburg. Univ. Math. Seminar., 1928, 6: 
303-322. See Appendix. 
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2. Involutions 

We now consider involutions, that is, matrices T of Sp($, 2n) such that 

T 2 = 3, ⑺ 

where 3 is the 2n-rowed identity. 

Theorem 1 Every involution in Sp($,2n) is equivalent sympletically to the 
normal form 

where J = [1， …，1， —1, …， —1] = { p , q}, say, consists of p ones and q minus ones. 
Precisely, there exists a symplectic matrix ^3 such that 

w_1 = (o °) 

The theorem is evident for n = 1; in fact, let 



(7) implies 

o 2 + 6c = 1, (a + d)b = (a + d)c = 0. 

Consequently, we have 

X=±0. 

Now we suppose that the theorem holds for symplectic matrices of order less 
than 2n. 

EVom (1) and ⑺， we have 

X5 = 矿 1 = 校 =-(Xff/, 

that is, TJ is skew symmetric, more precisely, % has the form 
T = 

where Ki and 尺 2 are skew symmetric. Since T is symplectic, we have also that 




Aft：!, K 2 A 


( 10 ) 
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e symmetric and that 

A 2 + KiK 2 = I 

There is a non-singular matrix Q such that 

，矿 0) 

. 0 0 J 


( 11 ) 


QKiQ'-- 


= 耵， 


where ki is a non-singular skew symmetric r-rowed matrix (in case r = n, there are 
no zeros in the expression). The transformat ion 


Q o 

0 Q'~ l 


)(^0(o ； -0" ( i2) 


carries Ki into K^, where 

( Q ° "I 
V 0 Q'- 1 J 

is clearly symplectic. Thus we may assume at the beginning that 


^i = 


From (10)，we have 
A 

The symplectic matrix 


( ai r) 、 

V 4 n_r,r) , 


0 0 


03 = 0 , aiki symmetric. 


carries (9) into 




A-K^S Ki 


- ( 13 ) 


fcf L ai fcf 1 a2 

, 0 , 


then the A of takes the form 



.170- 


华罗庚文集代数卷 II 


where 04 is non-singular. Now we assume at the beginning that A has this form. 
EYom (10) and (11), we deduce that 


K-x 


K 1 

0 


0 

fc 2 


Then, every involution is equivalent to 


1 0 0 A：i 0 〉 

0 a 0 0 

fcr 1 0 0 0 

0 k 2 0 a' j 

This induces two symplectic involutions 


By hypothesis of induction, the theorem is true for r ^ 0 and 一 n. A similar result 
holds if the rank of kz is neither 0 nor n. 

If the ranks of fci aad are both 0, then A 2 = /, it is known that we have Q 
such that QAQ~ r = J- The theorem is now evident, since 


(Q 0 


0 V 、 

(JO) 

\ 0 Q /_1 

l Jlo A') 

V 0 Q'~ l 1 

j) 


If r = n, then we have a suitable S such that (13) carries % into the form (9) 
with ^4 = 0. That is 、 

u. 

V if -1 0 I 


(14) 


Note that this case can only happen for even n, = 2m say. We can choose Q such 
that (12) carries (14) into the same form with 


( 0 \ 
\ - 1 ^ 0 / 


If (15) is satisfied, the symplectic matrix 
/ 0 0 
0 /( m > 



\ 0 0 


/(»») 

0 

0 

0 


0 

0 

0 

/( m ) 


(15) 


carries (14) into 



This gives 2 with zero Ki and the theorem follows from the previous case. 

Definition An involution equivalent to (8) is called an involution of signature 
(P.9>- 

3. Non equivalence of symplectic groups 

Now we consider a set of symplectic matrices (1 < i < /) satisfying 

苟 =3. (16) 

Theorem 2 We have l < 2 n and the maximum is attained. More precisely, 
any set of commutative symplectic involutions is equivalent symplectically to a subset 
of the set of matrices of the form 

(-)■ 

tuftere J is a diagonal matrix with 士 1 on its diagonal. 

The proof of the theorem is evident for n = 1 (cf. the first paragraph of the 
proof of Theorem 1). Now we apply induction on n. 

Let Ti be an involution different from ±3. By Theorem 1, we may assume that 
it has the form 

Ti = ( 二 ^ ) , J = {p, q}, p > o, g > o. 

These matrices commutative with are of the form 

/ a 产 0 6 产 0 \ 

^ _ 0 ^ 0 

= 0 0 
V 0 # o 4 9) 人 


Thus we have 


ai b\ 
Cl di 


02 h 

C 2 
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two sets of commutative involutions of Sp($’2p) and Sp($, 2q), respectively. By the 
hypothesis of induction we have Theorem 2. 

Since an isomorphic mapping carries involutions into involutions and commuta¬ 
tive involutions into commutative involutions, we have 

Theorem 3 For a fixed field 龟 ， no two Sp($, 2n) with different n are isomor¬ 
phic 

4. Generators of Sp($, 2n) 

Theorem 4 The group Sp($, 2n) is generated by the elements 



where M is a diagonal matrix with zero and one on its diagonal. 

Proof For a given symplectic matrix %, we can find a suitable ① M such that 


1 


M 

- {I-M) 



with non-singular A Then 


(“ )( 0 (“) 

where S = S'. Further, since 



where R is symmetric and 


(M 

I-M V 

1 ( 1 

f M 

i-m\_ 

.( 1 ~ R ) 

V -(,1-M) 

M ) 

U J 

^ - {I-M) 

M ) 

(o / J 


with M = 0, we have the theorem. 

① L. K. Hua. Geometries of Matrices HI. Trans. Amer. Math. Soc., 1947, 61: 229-255, 
Theorem 7. Notice that the proof holds for any field. 
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Theorem 5 The group T generated by all square elements of Sp{$,2n) is 
identical with Sp(<&, 2n). 

Proof Since the field is of characteristic ^ 2, we deduce that as S runs over 
all symmetric matrices, so does 25. IVom 

(o ： ) 2 =(oT). 


we see that T contains all elements of the form 



w 


where 屯 is any symplectic matrix and 5 is any symmetric matrix. In particular T 


contains 



0 



For non-singular S, we have 





-(/-J) 


thus, r contains elements of the form 


a 


0 

S- 1 


where S = S’. 

We shall justify later that any non-singular matrix A can be expressed as product 
of symmetric matrices, the group contains 
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symmetric matrices. 

Let Jij (* 7 ^ j) denote a matrix with 1 at (i,j) and (j, i) position and with 1 at 
(fc，fc) position for k ^ i and j and with zero elsewhere. It is known that the group of 
non-singular matrices A is generated by Jij and 



(l d 0 0 


f a 0 0 ••- N 


0 1 

0 0 … 


0 1 0 ■•• 

T{d) = 

0 0 

V： : 

10 ". 

:: / 

， H{a)= 

0 0 1 ••• 

l ； ； ： / 


Evidently Jij and H{a) are symmetric and since 




we have our assertion. 

5. Proof of the theorem 

1) The theorem is true for n = 1 ①. We suppose that the theorem holds for 
symplectic group of order less than 2 n. 

2) Since the normallsator of an involution of signature {p, 9 } is a direct product of 
two symplectic groups of matrices of orders 2 p and 2q, we see that the automorphism 
ACS：) carries an involution of signature {p, g} into an involution of signature {p, q} or 
{q,p}. By Theorem we may assume, without loss of generality, that 

J{l，n- 1 } 0 \_( 0 \ 


0 \ _ ( i 1 - 71 


0 {l,n-l }, 

{1，《-1} 0 


The aggregate of symplectic matrices commutative with 
( ±{l，n- 1 } 0 \ 


① Schreier and V. D. Waerden, ibid. Note that, in their theorem, they identified two matrices 


—T and X as a single element. But w 


d find the present conclusion easily. See Appendix. 
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consists of all elements of the form 


/ a 0 6 0 N 

无 = ° ^ j ^ ， Al = ■ A i n_1) ' Bi = B 广 ” ， etc. 

c 0 a 0 

、 0 Q 0 Dx / 


(17) 


We shall call % the direct product of two matrices 


t = t( 2 ) 


a 

c 




Bi 

A 


which are symplectic. We use the notation T = t x 1i. 

Consider the group A formed by the elements of the form 


’a 

0 

b 

0 、 

0 

±/ 

0 

0 

c 

0 

d 

0 

\o 

0 

0 

土 I j 

'a m 

0 

6* 

0 

0 

Al 

0 

B{ 

c* 

0 

d. 

0 

o 

C ： 

0 

Dl / 


where 

(A ： B\ \ 

\ C\ D\ ) 

for a group 2^. It is clear that Xj is a normal subgroup of Ti. But the factor group 
Sp($, 2n-2) over its centrum (3 (2n_2 ) ， —3 (2n)-2 ) is simple,® and then TJ is either %i 
or identity or the centrum. In case n ^ 2, the first case can never happen (Theorem 
3). The second case also cannot happen, for otherwise, the inverse of A will cease to 
carry elements of the form 

/ a 0 6 0 N 

0-/00 
c 0 d 0 

、 0 0 0 -/ / 


① Van der Waerden. Gruppen von Linearen TVansforman Theorem. Ergebnisse der Math., 
1935, 4(2): 10-11. 
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into elements of the form (17). Squaring all the elements of A, we conclude, by Theo¬ 
rem 5, that A induces an automorphism on l. Similarly, A induces an automorphism 
on Ti • In case n = 2, there is the additional possibility of interchanging both factors. 
But the inner automorphism 

'0 100、 / 0 1 0 0 \ 

1 0 0 0 ^ 10 0 0 

0 0 0 1 0 0 0 1 

\o o i o ； Vo 0 1 oj 

carries the factors into the right order. By the hypothesis of induction, we have a 
two-rowed symplectic matrix p, a matrix 

1 0 
0 oo 

and an automorphism a of 中 such that 

4(t) = prt 0 c-V 1 



and we have a (2n — 2)-rowed symplectic matrix 屯 i, a matrix 

_1 ) 0 、 




0 


w (n_1) ) 

and an automorphism 0 such that 
A(Z X ) = 


We define 


V = P x «p, 


and 


Then the automorphism 



0 

bol 


(wW-Ucr) 卿 ) f 1 


carries 1 given by (17) into 

/ a r 0 p6 T 0 \ 

0 Ai 0 B x 

(l/p)c T 0 cT 0 

、 0 Ci Q Di } 


Let A be the sub-group formed by matrices of the form 


(I [ai, •■- ,a„] \ 

V° 1 )' 

le equation (18) asserts that A induces a mapping carryi 
ider all the symplectic matrices commutative with every 
s form a group 2^ The elements of S 2 are of the form 





immediately 


[ai, … ,a„]C = 0, 

.Then D = A'~ l and B = AS. (19) implies also 








wnere i ana 1 are symmeinc. J.ne group tormea oy elements oi x.ne lorra 

I 0 
T I 

is denoted by Sq. 

Consider those elements carrying the groups S and Eq onto themselves, 
form a group with elements of the form 

f A 0 ) 

卜义 - 1 人 

Therefore we have 

A 0 \ _ f A" 0 \ 

0 A'- 1 J L 0 A m '~ l ) ' 
that is A induces an autormorphism of the general linear group (A *-* A*). 
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4) In (18)，we take 


A\ = al, b = c = 0, B\ = 0, Ci = 0, 

then we have 

. (al^ 0 \ = / [a T ,a,- - ,a] 0 \ 

\ 0 o -1 / (n) J \ 0 {o T , a, • • • . a] -1 / 

If the field contains more than three elements, we can find an element a 一 ±1. 
If a T ^ a and a -1 , the normalizer of 

(al 0 、 , ( [<» T ， a ， … ， a 】 0 \ 

V 0 J 〜 （ 0 [o T ,o, ••- .a]" 1 ) 

cannot be isomorphic. This is absurd. Therefore a T = a or a -1 for all a belonging to 
If there is an element a in $ such that 

a T = a -1 , 


then, if $ is of characteristic ^ 3, we have 


2a _1 = a -1 +o -1 = a T +a T = (2a) T 


(2a)' 
or 2a. 


Both are impossible for a 〆 ±1. If $ is of characteristic 3, we have 


1 + a -1 = 1 + a T = (1 + o) T 



l + o, 

or (1 + a) -1 . 


Both are also impossible for a ^ ±1. Therefore for a ^ ±1, we have 


o T = a. 

This holds also for a = ±1. Therefore we have r = 1, the identity automorphism. 
The field having only three elements is the field ： 


— 1,0,1 (mod 3). 


The only automorphism of the field is the identity automorphism. We have also r = 1. 
Therefore we have r = 1 in (18). 
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5) Another particular case of (18) is 


4( / - ,6 n ] ) = ( J ㈣ ，…， 

Combining (23) and (24), we have 

{ A)(o I6l '；'"')(o ；-) 

I ,b n ]A' 1 

0 I 

This asserts that (20) induces a mapping carrying the set of symmetric matrices onto 
itself and the ranks of S and 5* are equal. 

Combining (20) and (24), we have 





A 


I 

0 


s -[ 6i r w H( 

= / / 5*-[p6,,62,- -，M \ 



Consequently, if we have a set of values 6 i, • •• ,b n such that 
det(S- [ 61 ， … ,b n ]) =0, 


then it satisfies also 

det(S*-[p6i,b2, ■ ， M) = 0 


and vice versa. 

Since these equations are linear in each of the variables bi, ’ b n , and the char¬ 

acteristic of $ is not equal to 2 ①， the coefficients are proportional, that is, if we put 
S = {Sij) and S* = («*_,), 


i _ £ii _ 

P _ ~ P s u 

_ SnSjj — 


for 1 < t 
for 1 < j 


① Notice that by eliminating bi, we have an equation which is quadratic in ea«h of 62 , ••- ,b n 
and equal to zero for all 62 ,... , *>n. Thus the coefficients of the equation are identical zero. Conse¬ 
quently we have the assertion. 
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_ saajj - s? 

' P( s ii 8 jj - s i, ? ) 



Sll 

Sl2 

«13 ' 

/ 


s 12 

s 13 

= 

Sl2 

»22 

S 23 ! 

/ 

s 12 

s 22 

s 23 


«13 

S23 

S33 

/ 

1 ®13 

*23 

s 33 

a ll = 

sup, 

S:i: 

= 8ii, 

1 <i, 




s ij = a ij » 1 < ‘ < J •’ 

p(anSjj - 8i f ) = - = panSjj - S*J. 

Then we have 

P«ii = a ii- 

This is only possible when p is a square element. Then, we have 


(25) 


Consequently, we have 

an = 

Another special case of (18) is 

.(： 


a ii> a ij = ^ a ij- 


1 0 
0 B 
I 


1 0 

I 


(26) 


Using (26) instead of (24), we have, by the same argument, that 
= sjj for 1 < » < j. 

Let Gij (i ^ j) denote a matrix with 1 at (i, j) and (j, t)-position with 0 elsewhere. 
We have already proved that 


G ij = G *i for 1 < i < j, 

G；i = Gu. 

We have a matrix A = [—1,1, ••- , 1] such that 

AG\2-A! — — Gi2_ 

Thus without loss of generality, we may assume that A satisfies (18) with p = 1 and 
r = 1 and GJ 2 = Gu 
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If Gj 3 = -G 13 , by (25), we have 


0 


= 4, 


this is impossible. Therefore 


= G u . 

Since 

[a, 1 ， … ， l]Gi 2 [a, 1 , … ， 1 ]’ = aGi 2 , 

the additive property of 

S' = S 

the module formed by symmetric matrices implies for all symmetric S. Therefore 


0 = (； 


(27) 


6 ) Combining (23) and (27), we have 

ASA' = A*SA m, 


for all symmetric matrices S. Then we have 

A m = ±A. 


Those A satisfying A* = A forma subgroup T. The index of T in the full linear group 
is either 1 or 2. Now we are going to prove that it cannot be two, therefore T is the 
full linear group. By (18), all these A of the form 


(: 


4 二) 


(28) 


belong to T. For n ^ 3 the full linear group is generated by the elements (28) and 
J 12 , where Jij denotes a matrix with 1 at (i,j) and (j, i) position and with 1 at (A:, k) 
position for k ^ i,k ^ j and with 0 elsewhere. If the index is two then 


By (28), we have 


J12 ~ 

Jij = J%j for i > l,j > 1 . 


(29) 



Then, from 


we deduce that 


The last equation cont 


In case n = 2, by ( 


belong to r. Further w< 


Squaring both sides, w< 
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Evidently the full linear group is generated by elements of the form 



The assertion is also true for n = 2. 

7) FVom (18), we have 

\ -(I-M) M ) ~ \ -(I-M) M )' 
Combining (27), (30) and (31) and using Theorem 4 we have the theorem. 


Appendix: automorphisms of unimodular group 


Since the proof of O. Schreier and B. v. d. Waerden contains an unjustifiable 
point®, for completeness we include the following appendix. 

In the appendix, we do not assume that the characteristic p of the field is not 
equal to 2 and we only consider two-rowed matrices. The theorem to be proved is the 
following. 

Theorem Every automorphism, of the unimodular group of order two is of the 


form 



where P non-singular and o is an automorphism of the field. 

Definition For p 一 0, a unimodular matrix A is called parabolic, if A 2p = I. 
For p — 0, a unimodular matrix A is called parabolic, if there are infinitely many 
unimodular matrices similar to A and commutative with A. 

Lemma 1 A necessary and sufficient condition for a matrix A to be parabolic 
is that the characteristic equation has a double root ±1, and consequently, every 
parabolic element is similar to 



a 


① Loc. cit, p. 313. Let 5 be a linear fractional transformation on Let Ws be the normalizer 
of S and be the normalizer of They assert that 


2, if S is elliptic and hyperbolic. 


However this is not true for S being elliptic. Since the fixed elements of an elliptic transformation 
may not be in the field, sometimes there does not exist a transformation in $ to permit them. 
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under the unimodidar group. 

Proof 1) p 5 ^ 0. In an extended field, the characteristic roots of A are a and 
1/a. Prom A 2p = I, we deduce 

a 2p = 1. 

Consequently a 2 = 1 and a = a -1 = ±1. 

2) p = 0. Suppose ct _ ±1, we have in an extended field a matrix P such that 


If B is commutative with A, we have 
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We can always find a unique t in the field such that 
—ct 2 + (d — a)t + 6 = 0. 


Then 



carries the second matrix into our required form. 


ven automorphism to an inner automorphism we have 如， 《 S ， *o, *5, none of them 
zero and a choice of ± such that 




!, for A carries the group Ti 



rries the group r，formed by matrices 


,the statement is also true. Similarly 




Consider those elements which carry I 


form 



and constitute a group S. 

Farther consider those elements which 


the form 


Since 


0 

-d - 1 



keeps ri,r 2 and E unaltered and 


P 
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Since 


0 V 

=( 1 - x \ 

( i 



1 ( 1 0 ) 

\0 ) 

V« i ) 

U _i - 

1 ij 

(o ij 

v -a-^) i ； 


we have 

(P( x ) 。 \ = /1 ( 1 oWl l\ / 1 0、 

V 0 咖广 1 广 I 。 1 八《7(；|：-1)-1 1 八 0 1 八-1 + 咖 lj 
= (* 

~ V * oi . x - 1 ) J . 

We deduce 

1 一 o -(*)« t ( x _1 ) = 0, p{x)~ l = <r(x~ l ), 


that is, for a: ^ 0, 


o[x) = p(x). 


This asserts that (t(x) is an autmorphism of the field. 



On the existence of solutions of cert at in equations 
in a finite field* 

In another paper ① one of the authors stated that he had arrived at limits, both 
inferior and superior, for the number of solutions of the equation 

Cl®? 1 + C2®2* + . • • + c a x %' + c s +i = 0 (1) 

in the x’s where a’s are integers such that 0 < a < p n — 1; s > 2 for ^ 0 and 
s > 2 for c s +i = 0, the c’s being given elements of a finite field of order p n ,p prime, 
which will be designated by F(j) n ); and 

Ci … c,ii ■ • ■ / 0. (2) 

As a consequence of this result, one can obtain 

Theorem I The equation (1) with the restriction (2) always has at least k 
solutions in the x's for k any given positive integer provided. p n exceeds a certain 
limit. 

In this paper we shall give two quite different approaches to establish this the¬ 
orem. The first is closely related to the one previously mentioned and the other 
argument, although subject to the limitation s > 2 when c s +\ ^ 0, is far simpler and 
is based on a method ② which was introduced by one of the authors in the study of 
generalized Gaussian sums over a finite field. 

The limit given here can be sharpened, and the proof of this will be published 
later. 

Elsewhere® it was shown that the exact number of solutions of (1) may be 

* Reprinted from Proceedings of the National Academy of Sciences, 1948, 34(6): 258-263. By 
L. K. Hua and H. S. Vandiver. 

① These Proceedings, 1947, 33 ： 236-242. In this paper reference to previous results was made, 
but an important paper by Mordell. Maihematische Zeitschrift, 1933, 37: 207, which bore more 
directly on the contents, was, unfortunately, not mentioned. Other relevant references are Pellet, 
Bvll. Math. Soc. France, 1886, 15, 80-93; Dickson, Crelle, 1909, 135: 181-188; Hurwitz, Crelle, 
1909, 136: 272-292; Mitchell, Ann Math., II, 1917, 18: 120; Davenport, Jour. London Math. Soc., 
1931, 6: 49-54; Schur, I., Jahresber. Deutsch. Math. Verein., 1916, 25: 114. 

② Cf. a paper by Hua, Loo-Keng. "On a double exponential sum”，soon to appear in the 
Science Reports of Tsing Hua University. An abstract is given in the Science Record of the Acad. 
Sinica, 1, Nos. 1-2. 

③ These Proceedings, 1946, 32: 47-52. 
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determined directly if we know the exact number of solutions of of the 

equation 

0 i+ma + # +m0 + 1 = 0 (3) 

for each i and j; 9 being a primitive root of F[p n )\ (p n — l,aj) = i = 1,2, - • • ,s; 
m is the L.C.M of d\,d 2 ,-" ,d s . Further in this argument some or all of the a's may 
equal p n — 1. To prove this we employed formulae (5), (12) and (15) of the paper just 
mentioned and all of these contain positive terms only. 

We may adapt this method, however, to finding limits from the number of solu¬ 
tions of (1) by first finding limits for the number of solutions of (3) and then employing 
(5), (12) and (15) of the previous paper®. 

In the latter the formulae 

Aoo = (c - l) 2 + c(m - 1); Am = Mk = c 2 - c, 

Ahk =c 2 , h^k\ Ahh = c 2 - c, 

(mod m), fc _0 (mod m), p n =l+mc (4) 


are derived; where 


Ahk = + J + k ] 


and [t, j] is the number of different pairs r and t in the set 0,1, ••- ,c— 1 which satisfy 
the equation 

1 + 9 i+rm = e j+itn , p n -l = cm. (5) 

If {i,j} denotes the number of solutions of (3) then we note that 

[*»j] = + ⑹ 

where e = ind(—1), with x = 0 indl . We have 
0 to m—1 0 to m - 1 

E (m 2 [i,i]-p n ) a = X) (m 4 M 3 - 2 mVM)+p 2n m 2 , 

which gives, after employing (4) on the right as well as the known relations 
{t,j] = [?' + e, i + e] = \-j + e, t - j] = [i - j + e,-j) 

— [—*>i — i] = b — ^+ £ > ~+ e ' — ⑺ 




① These Proceedings, 1946, 32: 47-52. 
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where 

D = — 3m + 2) + 3m + 1. 

Here d is the number of different pairs in (7) in the sense that [/», fc] is different from 
if h ^ hi, or k^k ki (mod m). 

The use of (8) with (6) of this paper and (5), (12) and (15) of the former paper 
yield superior and inferior limits for the number of solutions of (1), and we obtain a 
proof of Theorem I. 

For the second proof of Theorem I (except for s = 2 with c*+i ^ 0), we proceed 
as follows: 

Any element a of F(p n ) may be written uniquely in the form, if d is a primitive 
root in F(p n ), 

a = do + <h0+-+ dn-xe n ~ l = h{0), (9) 

where the d's are in the included field F(p). Now 0 satisfies an irreducible equation 
f(x) = 0 of degree n with coefficients in F{p) and whose roots are … ,0 n , with 
9i = 9. Set (here tr is short for trace) 

h(9i) = tr h{9). 

«=i 


We shall now prove 

Lemma 1 If C,= e 2i " p , then, for n<p, 


E c tr(at, = 

a6F(p n ) 


o, 

.P n - 


for tj^O, 
for t = 0. 


( 10 ) 


This statement is obvious for f = 0. For t ^ 0, at ranges over all the elements of 
F(p n ). Hence it is sufficient to prove that 

E c tr(Qt) = E … E r = n (E 产…)) = 。， (ii) 

aeK do—0 d„-i=0 i=0 \di=0 ) 


where 

T = tr(do d\€ + ■ ■ • + d n —iO n J ). 


There is one of the i，s such that p does not divide tr ( 炉 ) .For, if the contrary is true 
then tr(0*) = 0 for all 0 < i < n - 1. Since 9\=0 satisfies (x) =0 of degree n as do 
also 办， … ,O n , then Newton's relations between the elementary symmetric functions 
of these roots and the sum of the nth powers of said roots give easily ff n = c, with c 
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in F(p). This yields = 1， contradicting the definition of ^ as a generator of 

F(p n ) for n > 1, since p n — 1 = (p - l)(p n_1 + • • ■ + 1) > n{p — 1). This proves (10). 
We now prove 

Lemma 2 Write K for F(p n ). If 

州， a,C) = E C tr(<BO) , 



lV»(<,a,C)l < (P n - l 1 o)p"/ 2 . (12) 

Now 

^ J>(* ， a 乂 ) 轉，以 _1 )) = E E c tr(ti °- tv0) - (i3) 

P teK t€K *.W6K 

Using Lemma 1, the right-hand member reduces to (p n — l)(p n — l,o), since this is 
the number of different solutions in x and y ^ 0 of 

x a = y a 

in F(p n ). Since |V>(t,a,C)| = then (12) gives, if we set a, C)= 少 (* ， a), 

-4^l^(«,a)| 2 = (p n ~ l)(p n -l,a). (14) 

P t€K 

Since, if t — 0, the number of solutions of ty a = q is ^ (p n — 1, a), then 
(p n - i)|v»(t,a)l 2 = 5ZI 州 y' a )l 2 

\% K o 

^ (p n -l,o) ^2 \ip(q,a)\ 2 
q€K 

<(p n -l,a)V(p n -l), 

and by (14), we then have Lemma 2. Then for the proof of Theorem I, we have 

Af = 4：E ai ) …柳山 a 8 )C tr(c * +lt) 

P teK 

= ㈣ 二 1 ” + 4S ••^(c»t,a.)C tr(c * +lt) , (15) 

P P t€K 

«9<0 


where N is the number of solutions of (1), and now we count all the solutions under 
the restriction (2) (in connection with (5) we did not do this exactly, that is, all the 



solutions of (5) would be Then 


卜 _ ^ & 1 ) 卜長 E W "( c “， fl i)l … k(c a t,a»)|, 

«|io 

and by Lemma 2, we have 

whence 

W _ 1): - Ap na,2 < W < ^ n ~ — + Ap n " 2 , 

p n V 

A = ^"-1,0*). (16) 

i-l 

The quantity A > 0 never exceeds the fixed value JJoj. Set p n — 1 = h. Then the 

<=i 

left-hand member of (16) becomes, if r = (^ + 2)/2, 



which for s > 2 and p n sufficiently large is obviously > k for k any integer. This 
proves Theorem I, for s > 2. Fh)m (16) we may also write 

N= +0(p n,/2 )- (18) 

We may now apply Theorem I to congruences with respect to an ideal prime 
modulus in any commutative ring R with a unity element. We may consider the 
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Theorem II The congruence (19) always has at least k solutions in the x's 
for k any positive integer provided 


«iQ 2 - - - a s xia ：2 • • • i, # 0 (mod p), (20) 

s ^ 2 for c a+ i 笋 0 (mod p) and s > 2 for c a+ i = 0 (mod p); p is a prime ideal of 
finite norm in a commutative ring R with a unity element, the a’a are fixed positive 

integers, the a’8 are fixed elements in R, and the norm ofpis sufficiently large. 

If i? is the ring of integers in an algebraic field then every ideal has a finite norm 

so that Theorem II holds for any algebraic ring of this type with p any prime ideal 

in the ring. If i2 is the ring of rational integers, then n = 1 and R is the system of 
residue classes modulo p, and we have in particular the 

Corollary If in (19) the x's are rational integers and p = (p) with p a prime 
rational integer then this congruence always has k solutions for p sufficiently large 
with the other conditions in Theorem II holding, k any integer. 

This corollary was given by Mordell when a a +i # 0, and we include aW ① solu¬ 
tions of (19). We considered only solutions prime to p (primitive solutions) in our 
work, following the conditions (2) and (20). 


① Mordell's method of proof is different from either of those used in the present paper. 

In some ways there is quite a distinction between finding the primitive solutions of and equation 
in a finite field and findiog all solutions. The congruence 

if + a:* + •••+*»= 0 (mod 2) 

has no solutions in integers prime to 2, if a is odd, but evidently has solutions for some of the x's 
even. Again the congruence + + 1 = 0 (mod 491) has no solutions in integers x and y prime 

to 491. But the congruence x 7 + 1 = 0 (mod 491) obviously has solutions. 














Characters over certain types of rings with 
applications to the theory of equations in a finite 
field* 

Characters, modulo m, have been defined (for example by Landau ①） and have 
been employed extensively in Group Theory and Number Theory, particularly in 
some of the most important parts of Analytic Number Theory. However, we may 
define them by means of any ring with a unity element and which contains only 
a finite number of elements which are not zero divisors, instead of the special ring 
consisting of the residue classes modulo m, and a number of extensions of known 
results concerning said special case are immediate. We will set up this concept and 
then specialize it for the finite field using results which are then applied to the theory 
of certain equations in a finite field which were considered in another paper ② by the 
authors. This type of equation is 

+ c 2 X 2 a + ••• + c,xj* + c,+i = 0, (1) 

where the a 5 s are integers such that 0 < a < p n — 1; « ^ 2 for c 4 +i ^ 0 and s > 2 for 
Cg+i = 0, the c’s being given elements of a finite field of order p n , p prime, which will 
be designated by F(p n ) and 

Ci - - - c t xi •••x B #0. (2) 

In the present paper we obtain a new explicit expression for the number of solutions 
of (1) in terms of generalized Lagrange resolvents and (see the r number defined in 
Theorem I) and using this we find better limits for the number of solutions of (1) 
than those derived in E. 

# Reprinted from Proceedings of the National Academy of Sciences, 1949, 35(2): 94—99. By L. 
K. Hua and H.S. Vandiver. 

① Vorlesungen iiber Zahlentheorie. Leipzig, Hirael, 1927, Bd. I, 83-87- Cf. also Dickson. 
Modem Elementary Theory of Numbers. University of Chicago Press, 1939: 272-276. 

② These Proceedings, 1948, 34: 258-263. This article will be referred to as E. Characters 
defined by a general finite field have been considered by a number of writers. Cf. the paper referred 
to in our footnote (D in page 190 of this book and the references given therein as well as Davenport. 
Acta Mathematica, 1939, 71 : 9&-121. 






finite number of elements which are not divisors of zero. A set of such elements forms 
a group. For, since it does not contain zero or any divisors of zero, the cancellation 
holds and a finite semi-group having this property is known to be a group. Or, 
otherwise expressed, these elements are units in R. 

Let an arithmetical function x(a) be defined for any element a of R bo that 

I. x(a) is a complex number. 

II. X(l) / 0. 

III. x ⑷ = 0 when a is zero or any divisor of zero. 

IV. If ai = a 2 in R, then x(<*i) = X(a 2 ). 

V. x(aia 2 ) = x(oi)x(aa)- 

The arguments employed by Landau for the special ring formed by the residue classes 
modulo m by which be obtains his Satz 127 to Satz 133 inclusive may be easily 
extended to yield the following: 

For each character \ we have 

X ⑴ =1. (3) 

There is a character designated by xo such that 

Xo(<0 = 1， （3a) 

for each a in R. Let <f>(R) — h be the number of distinct units in R. We shall call h 
the indicator^ of R. Then since the units u forma group, u h = 1 in R, and by I and 

‘ X(a) = e^ h , ⑷ 


for 0 < h. 

We also obtain 


E^°) = { 

a€R V. 


/»， for X = Xo, 
0， for X ^ Xo- 


Suppose that we have, if/i = 0 (mod k), a character such that (x(a))* = 1 for 
each unit a in R, this will be called a special character and designated by Xfe- I n 
particular Xo j 8 a special character for any fc. 

① This twm is employed for the reason that the special R consisting of the residue classes 
modulo m, our units correspond to the integers less than m and prime to it. 
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Now let ilbea finite field of order p n , p prime, and designate the same by F(p n ). 
Let 0 be a primitive root of this field, and let y9 be a primitive (p n — l)th root of unity. 
Then by (4) 

0 = 0 b , 0<b<p n -l, 

and since all the elements, # 0, of F(p n ) are given by powers of 0 then the whole 
character, by V, is given by powers of 0 b . Now consider Xk- Then 

xk(e) = ^ ⑹ 


and by definition of X*»(X*W) fc = ^ dk — 1 which gives d = 0 (mod( p _1 )/fc). It 
follows from this that there are exactly k special characters, and in particular there 
axe h distinct characters defined by R. We now have 


Lemma II 



for a a fcth power, 
otherwise. 


This follows easily from (6) and the remarks following it. 
We also have ① 

Lemma III ；/C = e 加 ' then in F(p n ), 


⑺ 


{ 0, for t ^ 0, 

念 1 p", for« = 0. 


⑻ 


We now proceed to® 

Theorem I The number of solutions N of (1) with the restriction (2) is 

N _iP n -iy , t 

p n p n， 


where s 

nE^( a " ic . ri Mxib.)c tr(ac * +,) , 

a « e 4 <=i ^ 

① In this paper we are using the symbols 1 and 0 both for the unity and zero elements in the 
complex field and the unity and zero elements in R. This should not cause confusion if we keep in 
mind that x(a) is always in the complex field while a is always in R. 

© This was proved for n < p in E, and the argument employed there may be extended to 
the case n 彡 p. However, the result was previously known in general, in fact it follows from the 
results in Stickelberger. Math. Annalen, 1890, 37: 321, §1, paragraph 5, since any finite field may be 

represented Iqf a complete set of residue classes with respect to a prime ideal modulus in an algebraic 










198 - 


华罗庚文集代数卷 II 


b6K 

and fcj = (Oj ， p n — 1). Also K stands for F{p n ). 

To prove this we note that 

y aeK i=l 

*H«o 

G(x) = C 1 X 2 1 + c 2 x§ 2 + •■• + c a x^ + c,+i. ⑼ 

First, it is known that the number of solutions of (1) and the number of solutions 
of G(x) = 0 are the same if (p n — l,Oj) = ki. Also when we carry out the summation 
in (9a) for the Xi’s all the terms are unity when we have a set of ®，s which satisfy 
G(a) = 0 with the restriction (2) and when we sum with respect to a we obtain p n 
according to Lemma III, but when we have a set of art's such that (1) is not satisfied 
then by the same lemma the sum with respect to a gives zero (the relation (15) in E, 
p. 261， is a transformation of (9)). In the right-hand member, if we separate the zero 
and non-zero values of a, we obtain from (9), 


N 




( 10 ) 


y o€K t=l 
»5«0 


Now employing Lemma II, 


^ C tr(cx-) = ^^ X/ , (c -l J/)c tr(„) 
V€K Xk 


and then (10) gives 


r =e ni >‘( c 「 ia_i ) T (办 ‘K tr(oc * +o . 
>=1 Xk* 


⑽ 


We now employ (10) and (11) to find limits for N. We see from ⑷ that for a 
fixed a 

i 

»i(o) = X«(a)i ( lla ) 


«=i 
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for some v, Also it is konwn® that 

|t(x)I = P n/2 . X^Xo, 

k(Xo)| = 1 - ( 11 b) 

If we write by (11a), 

nXfci( a-1 ) = X(a)» 

t=i 

E = X(c；^) E x(a)C tr(o) = XicJ^rix), c. +1 # 0 , 

aeK a€K 

and employ (11) and (lib) noting that there are k characters Xfc, and that x is a xie, 

«=1 Xki 

|r|^n(i + (fci-i)p n/ 2 )p n/2 . ( 12 ) 

i=l 

Let us now suppose that c,+i = 0. Then we have = 1 in (11) and 

^2x(c ： 1 a- 1 )^p n -l 

<*6K 

and then 

|r| < (p n _ l) 亡 (l + ㈨ -i)p n/2 ). (i 3 ) 

«=i 

Hence (10) gives for c a+ i ^ 0, 

\ n - U f[(l + (fci - l)p"/ 2 )p -"/ 2 ( 14 ) 

I p ' i=i 

and for c s +! = 0, 

卜 ^卜 ^^nd+w- 1 〆 2 ). ( 15 ) 

Whence we have 

① Davenport and Hasse. J. f. Math. {Crelle), 1935, 1T2: 151-174 found, for the case a = 2 
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Theorem II If N is the number of solutions of (1) under the condition (2), 
then for c a+1 ^ 0, (p n - 1 , 0 ^) = ki, 

p n p n 

and for c B +i = 0, 

^r^ Dl ^ N ^^y +Dl 

p n p n 

where 

z 3 =p-"/ 2 n(i+(fc j -i)p"/ 2 ) 

i=l 

and 

Di = ^^fl(l + (ki-l)p n/2 ). 

P i=l 

R-om the above theorem we infer immediately that 

where v = 1 or 0 according as c a+1 ^ 0 or c*+i = 0. 

If we consider (1) without the restriction xi,X 2 ,- ■ • ,x a ^ 0, we may find limits 
for the number of solutions by considering one or more of the c’s in (1) as being zero 
in turn and employ for each of the resulting equations the limits (16) and (17) and 
adding the results we obtain inferior and superior limits of somewhat the same type 
as (16) and (17) and in particular we infer that if Ni is said number then 

Ni= p—- 1 ) + 0 (p"(*-«)/ a ), (19) 

v having the same meaning as in (18). 


(16) 

(17) 



On the automorphisms of a sfield* 


Let if be a sfield. A mapping (a —» a* 7 ) of K onto itself is called a semiautomor¬ 
phism if it satisfies 

(a + 6) ff = a ff + 6 <7 , (1) 

(aba)° = a a b°a a (2) 

and 

l a = l. ⑶ 

The well-known examples of semi-automorphisms are automorphisms, which sat¬ 
isfy [ab)° = a a b a , and anti-automorphisms, which satisfy (ab)° = b a a°. 

It is a known problem about the existence of semi-automrphism other than auto¬ 
morphisms and anti-automorphisms. It is the aim of the note to settle this problem, 
namely: 

Theorem 1 Every semi-automorphism is either an automorphism or an anti¬ 
automorphism. 

To prove the theorem we need several simple consequences of (1), (2) and (3). 

Putting b = a -1 in (2), we have 

= (一 )- 1 ⑷ 

and replacing a by o + 6 and b by 1 in (2), we have, by (3), 

{ab) a + (6o) ff = m b°a°. (5) 

Applying (2) twice, we obtain, by (4), 

(ba) a = (a6(oft)- 1 6a) cr = a a 6 a (o6) <T - 1 6 <, a <, . (6) 

Substituting (6) into (5), we have 

(ofc) ff + = a a b ff + b°a°, 

* Reprinted from Proceedings oj the National Academy of Sciences, 1949, 35(7): 386-389. 
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which is equivalent to 


((oft)* 7 - a a b a )(l - (aby-^aT) = 0. 


We deduce immediately 


w a - 


a°b°, 

b°a°. 


Suppose that we have a pair of elements a and b such that 
(ab) a = b a a° ^ a a b°. 

Then, for any c, we have 

(ac)^ = <fa a . 

In fact, otherwise, we would have, by (8), 

(oc) ff = a°(f ^ (fa°, 

and, by (9) and (1) 


o^c 0 ■ + b a a a = (ac) <r + (a6 广 =(a(6 + c)) 17 
Both conclusions are impossible. Similarly, we prove that, for any d, 


a a {b° + <f), or 
(b a + <f)a a . 


(db) ° = b a dT. 

Now we are going to prove that 

(dc) a = c a dT. 

Suppose the contrary, that is, by (8), 

{dcY = (Tc 17 (一 c a dr). 

Similar to the previous argument, we can establish that 
« = a a c a 

and 

(db) a = dTb°. 


⑺ 


⑻ 


⑼ 


( 10 ) 


( 11 ) 


( 12 ) 


(13) 

(14) 
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Now we consider the elements: 


b°a。+ (ac)° + {db) a + d^c° = ((a + d)(64 - c)) a 

f (o* 7 + + (f), or 

\ + ^)(0^ + d a ). 

The first conclusion contradicts (9), by (10) and (11), and the second conclusion 
contradicts (12), by (13) and (14). 

Therefore, if there is a pair of elements a, b such that (ab)° = b°a a {^ o. a b°), then, 
for any pair of elements c and d, we have (cd)° = d a <f. This proves our theorem. 

By means of the previous theorem, we also settle a problem in the study of 
projective geometry on a line over a sfield. Namely: 

Theorem 2 Any one to one mapping carrying the projective line over a sfield 
of characteristic ^ 2 onto itself and keeping harmonic relation invariant is a semi- 
linear transformation induced by an automorphism or an anti-automorphism. 

This theorem was established for the quaternion algebra (Ancochea®) and later 
for division algebra (Ancochea ②)， characteristic 〆 2 and he left the general problem 
open. 

As an easy consequence of the following theorem on geometry of matrices (the 
proof will be given elsewhere), we can extend Theorem 1 to any semisimple ring with 
a descending chain condition. 

Theorem 3 Two nxm matrices Z and W are said to coherent, if the rank of 
their difference Z-W is one. Suppose 1 < n < m. Any one to one mapping carrying 
nxm matrices into nxm matrices and leaving the coherence relation invariant ia of 
the form 

Zi = PZ°Q + R, (15) 

where P (= P^) and Q (= Q( m )) are non-singular and R = /2 ( n,tn )，and a is an 
automorphism of the sfield. In case n = m, in addition to (15), we have also 

Zi = PZ'°Q + R, (16) 

where a is an anti-automorphism of the sfield and Z' is the transposed matrix of Z. 


① Ancochea G. J. Math., 1942, 184: 192-198. 

② Ancochea G. Annals Math., 1947, 48: 147-153. 
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By entirely different methods, Ancochea and Kaplansky ① treated the problem 
of semi-automorphisms under some restrictions. The former established the corre¬ 
sponding theorem for a simple algebra over a field of characteristic ^ 2, and the later 
extended it to semi-simple algebra over any field. Both of their methods rooted in 
the structure theory of linear algebras, therefore neither of them can be extended to 
the general case. 


① Kaplansky I. Duke Math. J., 1947, 14: 521-525. 



On the number of solutions of some trinomial 
equations in a finite field* 


In the present paper we shall use the results in two previous papers ① by the 
authors to obtain (Theorem I) the exact number, explicitly in terms of p, n, ki and 
A ： 2 , of solutions of the equation, if 0 < a< < p n — 1, (p n — l,Oj) = ki, i = 1,2, 

cix? 1 + C 2 l 5 S + c 3 = 0, (1) 

in xi and X2, non-zero elements of a finite field F(p n ) of order p n , p an odd prime 
with ci,C 2 ,C 3 given elements of f'(p n ),ciC 2 C 3 ^ 0, in certain special cases. Secondly, 
we shall find limits (Theorem II) for the number of solutions of (1) which are better 
than those given in C for the solutions of this particular equation, and which have 
the unusual property that they agree with the exact values found for the number of 
solutions in the special cases treated in Theorem I. We shall employ the notation used 
in H (defined in (9b) just below relation (14) of that paper). 

Let a be a primitive (dsiS2^-th root of unity, m = dsiS2 where da\ = A ： i, ds^ = 
where (fci,fc 2 ) = d, whence («i,S 2 ) = 1- Let \k, be a special ^-character and \k 3 a 
special ^-character in F{p n ) and write, if K stands for F(p n ), 


^ yi** i»d<a)ctr ⑷ 


and 

T (Xfca) = ind ⑷ C tr(0) , 

aeK 

where tr(a) is the trace of a in F(p n ) with respect to F(p), and 穸 1 « is defined by 

fl ind(a) = 


* Reprinted from Proceedings of the National Academy of Sciences, 1949, 35(8): 477—481. By 
Loo-Keng Kua and Vandiver H S. 

① These Proceedings, 1949, 35: 94-99, this will be referred to as C; 1949, 35: 451-457, this 
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in F{p n ),g being a primitive root of F(p n ) and we define ind (0) as zero. Hence, as 
is known, provided Xfc'Xfej 5^ 

Q(xD p 

ind(a)+/ia»i ind(l—o) 

Also, setting (—ai) for (a) we obtain 




82 ind(a)+ 卩 2 ^i ind(l—o) 


E 


^ 1*2 ind(—ai)+/J 2 «i ind(l+ai) 
a j»is a ind(— 1 ) t ind(ai)+M 2 Si ind(l+oi)_ 


o,e/C 


Let n = 2ni, then p 2 " 1 -1 = 0 (mod 4) and if (/iiS 2 (p 2ni - 1))/2 = e so that 

a# i,« a ind(-l) _ (_1)« = 1 ， 


rWrW 
T(XM ' 


5 > Miaa 

aeK 


= 妒 (a). 


( 2 ) 


Now Mitchell ① showed that if H\S 2 ^ 0 (mod m), ^ 0 (mod m) and /XS 2 + //si 

0 (mod m) then 

分 ⑷ = (-l 广 V 11 ， (3) 


where p belongs to the exponent 2t, modulo m and n\ = rt with p t = —1 (mod m). 
We now employ (18) of H, which may be written as follows: 


c 3 、 r(xg ； )r(^) 




⑷ 


where in the last term xg 〆 1， xg ^ 1, 〆 1， divide the discussion 

into seven cases in order to express s in terms of integers only. Because of (3) and 
^(a) = -1 for Hi = 0, /12 ^ 0, also for m 笋 0, 的 = 0, this is always possible. 

The cases are: 

I. C 3 = c 2 = Cl. II. c 3 = Cl,C2 ^ Cl with ind c 2 = ind ci (mod d). III. c 3 = ci, 
ind ca _ ind ci (mod d). IV. C3 = C2,ci ^ Ca, ind C2 = ind Ci (mod d). V. C3 = C2, 


① Ann. Math, II, 1917, 18: 120. 
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C 2 _ Ci (mod d). VI. c 3 ^ C 2 , C 3 ^ cj, ind ca = ind ci (mod d). VII. c 3 _ ci,c 3 ^ C 2 , 
Ci 笋 C 2 (mod d). Write 

<5) 

wherex^^l.X^Vl.xK^ 1 - 

In case I, we find 

go ， 

IX-s)=- 

☆r( 

To reduce A in this case, we take all the terms in which ^ 1, Xk\ ^ ^ and then 
subtract all the terms from the result in which = 1 with x *, 1 # l,Xfc* # 1 and 

employ (3). The terms of the first type give 

(幻 -1)( 如 —1) 广 (-1 广 1 • 

As in the proof of (18) of s there are (d — 1) terms of the second type just mentioned. 
Hence (4) reduces to 

p" + 1 - 幻 _ fca - d + (( 幻一 l)(fe, - 1 ) - (d - 1 ))(- 1 )卜 V 1 • ( 6 ) 

For case II, we find the same value for the first summation in (4) but for the 
second, we have the value 0 since C 3 ^ C 2 . In reducing A for this case and separating 
the summation in the same way it was separated in case I, we have for the first part 
(k\ — l)p n, (—l) r and for the second part (d — l)p n, (—l) r . Hence, we have for case II 
the value, if Ni denotes the number of solutions in case i, i = I, II, • • • , VII, 

N 2 =p n + l~d-k 1 + {k 1 +d- 2)p n >(-l) r . (7) 

Similarly for cases III, IV, etc., we have 

N 3 =p n + l-k 1 +(ki - 2)p nt (-l) r , ( 8 ) 

N 4=p ^ + l-d-k 2 + (k 2 + d- 2)p ni (-l) r , (9) 
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N 5 -- 

= p n + l-k 2 + (k 2 -2)p ni {-l) r , 

(10) 

n 6 -- 

= p n + l-d + jJ ni (-l) r (d-2), 

(11) 

Nt- 

= p n + l + (-l) r - 1 2p n '. 

(12) 

If Ni is the number of solutions of the equation 



ClXi 1 + C 2 X2 3 +C 3 = 0 


in x\ and X 2 , non-zero elements of a finite field F(p n ) of order p n , p an odd prime, 
with ci,C 2 ,C 3 given non-zero elements of F(p n ) with N“ * = 1,2, •• - ,7, the number 
of solutions in the corresponding seven cases numbered I, II, • • •, VII, given just above 
the relation (5); 0 < a< < p n — 1, (di,p n — 1) = A ： i, i = 1,2; m = dsjS 2 , dsi = 
ki, ds 2 = kz, (ki, A ： 2 ) = d\n = 2n\, p belongs to the exponent 2t modulo m, n\=rt 
mth p* = —1 (mod m), then the values of Ni are given by the relations (6)-(12), 
inclusive. 

Mithcell obtained these results for the special case when k\ = A ；2 by a more 
complicated method. 

We now use (4) to find closer limits for N in the special case when s = 2, than 
those found in C for equation (1) of that paper. Set 

翁:卵-:科 f ). 

Now every term on the right of (4) is real so that we may write TV < M + |A| 
and N ^ M — \ A\ and using the known result 


we have, if 


|r(x)l=P n/2 , X^Xo, 

(13) 

D = _ - l)(fe - 1) - (d - l))p "/ 2 |， 

(13a) 

N^M + D, 

(14) 

N^M-D. 

(15) 


To obtain limits in terms of rational integers only, divide the discussion into the 
seven cases I — VII defined in the proof of Theorem I. Then we can evaluate M exactly 
as in the determination of N in Theorem I in each of the seven cases®. 

① Asa check on the accuracy of our work, the present problem was discussed in a mathematical 
seminar, using a different method of proof. During the resulting discussion, Mr. Olin B. Faircloth 

made a suggestion, which, when used in connection with the method employed in this paper yielded 

a better superior limit for N, than that which we originally had. 








agrees with the limits found in Theorem II, when c\ = C 2 = C 3 , with the superior limit 
when r is odd and the inferior limit when r is even. For the case n = l,ki — an 
odd prime, this fact was noted by Mitchell, who also stated that Hurwitz's method 
could be extended so as to obtain limits for the case when n is general, which had a 
similar property. 











On the nature of the solutions of certain equations 
in a finite field* 

In two recent papers ① the authors considered the equation 

ci®?* + c 2 a ： 2 a + ••• + c s x a a + c t+ i = 0 (1) 

in the x's, where the a's are integers such that 0 < a < p n — 1; s > 2 for c s +i — 0, 
and s > 2 for c s+ i = 0 the c's being given elements of a finite field of order p n , p an 
odd prime, which will be designated by F(p n ); and 

Cic 2 - - - c a xiX2 . •. 〆 0 (2) 

in F(p n ). Limits were found for the number of solutions of (1) and it was proved 
that for p n sufficiently large (1) always had solutions under the restriction (2). In the 
statement above concerning (1) we made the provision that 9^2 for c s +i ^ 0. If we 
consider the case where s = 1, (1) reduces to the equation 

ci®i* + c 2 = 0, (3) 

with C]C 2 ^ 0. Since any finite field can be represented by means of the residue 
classes with respect to some prime ideal in an algebraic field, then examination of the 
solutions of (3) is included in the theory of the congruence 

x n = a (mod p), ⑷ 

where a is an integer in an algebraic field K, p is a prime ideal in that field and a ^ 0 
(mod p). The study of the last relation led to the theory of the class field and the 
laws of reciprocity for nth powers®. 

We wish to point out the sharp distinction between this problem and the problem 
of finding the solutions or the number of solutions of the equation (1) and when a ^ 2 
for Ct+i 0. 

* Reprinted from Proceedings of the National Academy of Sciences, 1949, 35 (8): 481-487 (By 
Hua L K and Vandiver H S.) 

① These Proceedings, 1948, 34: 258-263, this will be referred to as E, 1949, 35: 94-99, this 
will be referred to as C. 

② Hasse H. Ber, Deut Math Vereitl, 1926, 35: 1-55; 1927, 36: 233-311 ； Erganzwigsband ， 1930 




In addition to this ， we shall also obtain an expression (Theorem I) for the number 
of solutions of (1) when c s+ i ^ 0 and s = 2, which leads to the proof of Theorem 
I and which we shall also have occasion to make use of in another paper. Also we 
obtain Theorem II, which gives the exact number of solutions of (1) when the fc’s are 
prime each to each, c*+i = 0 and (di,p n — l)=fc{, i=l ， 2,. • • ， s. 

We consider equation (4) and confine ourselves to the case where if is an algebraic 
field which contains a primitive Zth root of unity with l prime. Then 

x l — a = 0 (mod p) (5) 


if and only if the Zth power character of a, modulo p is unity, or 



However, by a known theorems ① 



for an infinity of distinct prime ideals p provided a does not equal an 2th power of 
an integer in K. Now the last statement is contrary to what we have for s > 1 in (1) 
with c,+i # 0. For, as we have shown in E (p.262) the congruence 

ai®? 1 4 - a 2 x^ + ••• + + a,+i = 0 (mod p), (6) 


with 


Q 1 Q 2 • • • a a xiX 2 • • 'X a ^ 0 (mod p) 


for s > 2 with c a+ i 笋 。 (mod p) always has solutions if the norm of p exceeds a 
certain limit, in fact (6) always has at least k solutions in the x's for k any positive 

integer if the norm of p exceeds a certain limit. In particular when we consider the 

case where there are no solutions it is known ® that a: 7 4- y 7 + 1 = 0 (mod p) has no 
solutions in integers prime to p for just four primes p, namely 29, 71, 113, 491, but 
has solutions for all the other primes p. As for i 5 + y 5 + 1 = 0 (mod p) it has no 
solutions for p = 11, 41, 71, 101. Also : c 3 + y 3 + 1 = 0 (mod p) has no solutions only 
for p =7 and 13. It is also knoun® that for m < lOi, and p=l + ml and h(l) is the 
number of primes p for which 


a* + j/ 1 + 1 = 0 (mod p) 


⑺ 


Coll Sci, Imp. Univ. Tokyo, 1920, 42: 16. 
.Messenger Math., 1908, 38: 14-33. 
are due to Beeger N G W H, unpublished. 
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has no solutions for xy ^0 (mod p) then h(ll) =3, /i(13)=4, h(17)=4, fe(19)=6. In 
view of these facts it may happen that h(l) increases as l increases. Note also from 
the cases 3, 5 and 7 that there is a range of values for p in which the corresponding 
congruences may or may not have solutions. 

From the above considerations we may say that associated with any form of the 
type given by the left-hand member of (4) there is an integer e(A) such that there are 
exactly e(fc) values of p such that (4) has A: solutions. Even in the case where k is zero, 
the class of forms satisfying the condition may be rather general. For example, ① it 
was shown that if c is a prime and m is an integer such that p = 1 + me with p prime 
then 

OiiT* + 023:2* + ••• + a,i7* = 0 (mod p) 

has only the solutions x x = x 2 = ■ ■ • = x, =0 (mod p) provided s ^ c - 2 the sum of 
no n of the a's is = 0 (mod p), 0 < n ^ s, and 

(|ai| + M + -- + K|)^ (c) <p, 

where <j>{c) is the indicator of c. Select ai = 02 = • • • = a«-i = 1 and a a — —s, and 
also an m and p with p=l + mc with c fixed and satisfying the other conditions just 
mentioned then it is evident that 

+ + 1 ™! -32^=0 (mod p) 

has no primitive solutions (incidentally, in view of Theorem III of the article just 
referred to, the equation 

xT + x^ + '-'+x^-sx^ 1 =0 

has no solutions in rational integers unless each 1 is zero, if there is a prime p with 
p = 1 + me and (2s - < p). 

We now recur to the subject of the number of solutions of equation (1), treated 
in E and C. The case s = 2 with c 3 # 0, has received considerable attention in the 
literature •② Let us examine 

C 1 X 1 ' + C 2 X 2 3 + c 3 = 0, ⑻ 

with 

X 1 X 2 C 1 C 2 C 3 ^ 0 (8a) 

① Vandiver, H, S. these Proceedings, 1946, 32: 101-106. 

(2) Cf. footnote 1 of E. 
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Theorem I of C, we see will give, if JV is the number of solutions of ( 8 ) in Xi and 
under the conditions ( 8 a), 


where we may write T in the form, if ki = (aj,p—1), hi ranges over 0,1, • • ■ — 


over 0 , 1 , … ,ki — 1, 


EE (9a) 

%o ⑽ 

where Xk, is a special character defined by fci or as we shall say, a A ： i-character, and 
similarly Xk a >s a special character defined by ki and where, if we write K for F{p n ), 

r(x,C) = r(x)=S x(6)C trW ； C = e ㈣， (9b) 

b€K 

with x(0) = 0. This is obtained from the fact that if a is a primitive (p n — l)th root of 
unity then ind °, where a ranges over all elements of F{p n ), ^ 0, gives a character 
as defined in C where ind a is given by 


ff ind a = o, 


and where g is a primitive root in F(p n ), so we obtain all characters by taking fi 
0,1, ••- ,p n — 2. We have from C the relations, is xo * 8 the principal character, 


|t(x)I = P n/2 ； X ^ Xo, 

r(xo) = -1- 


( 10 ) 


To reduce (9a) we divide the discussion into four cases. 

First Case. 说 =l，xg = 1. In this event x *； (a -1 ) = X^(a _1 ) = 1. XkJ (cf 1 )= 
(c^ 1 ) = 1, which reduces (9a), using (10), to 

^^tr(aca) = _J (11) 

Second Case, x *, 1 = 1 ； Xkl ^ 1 - This gives from (9a) and (10), 

- E T « X ^ la_1 )^ r(aC3) = 5> t ( 认)从 ㈤ 1 )“⑷ r 1 (咖 3 )， 

a a 

and setting ac^ = —ai, using C 3 ^ 0 , this reduces to 

- E T Ukl)Xkl (-C2 ^3)(^= -^ T (Xfc»MXfe 2 )Xfc J (- 碧 ) > 
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and using the known relation 


(11a) 

this is seen to equal 

P2 ' ^ 

(12) 

Also, if 一 1 and x 1 ^ = 1 we find in a similar way that the result is 



W ㈢. 

(13) 


Third Case, x'k.x'k, = 1 with x*, 1 ^ l.Xfc* + 1, (9a) reduces to 

(d m (CX X ， oc tr(ocs) . 

a Ml 

Now 

r(Xfcr-C) = r(xrr- r 1 )X*,(-l)= ^xST^(-l)- 

Hence for the admissible values of m , (9a) reduces to 

-E^(-|)p"- (14) 

Ml ' 7 

Now we must determine the possible values of Hi from = !■ Let a be a 

primitive (d«iS 2 )th root of unity where dsi = k\,ds 2 = 1^2 and where (A ： i, A 2 ) = d, 
whence (si,s 2 )= 1. Then Xfc, l Xfcj =1 gives integers /ij and fi 2 such that a *i>*i+»aMa = i 
or S 2 fii + 81 H 2 = 0 (mod (dsi« 2 )) so that m = v\S\,H 2 = W 2 « 2 - Hence the possible 
values of 卩 1 are Siur, Vi = 1 ， 2, ... ,d - 1, as the case ui = 0 was excepted. Hence 
(9a) reduces to, for this case, 

- L (-|)»" ( 15 ) 

Fourth Case. Here ^ jt 1, Xk\ ^ 〆 1. Consider such terms in (9a). 

They may be written in the form 

E E {: >-)C 咖 3 ) 成 (0<0^). (16) 

/ <i>^2 a6if 
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Write oi - ac 3 , then a~ 1 — and using C 3 _ 0, 

a a\%0 aiEK 

aiSK 

Hence using (1 la), we find from (9a) that the terms in case 4 in T may be written as 

㈤ 

xll ^ 1, X^l ^ 1. xTXl ^ 1 - Using (9), (9a), (11), (12), (13), (14), (15) and (17), 
we find 

+ (-如 ㈢ 纖 f ， ⑽ 

where, in the last term, Xk , 1 ^ 1 . ^ 1 . XfcjXfc, ^ 1 - 

Whence we have the (cf. Davenport-Hasse ①). 

Theorem I If N is the number of solutions in xi and X 2 of 


CiX^' + C2X% 2 + C 3 = 0, 

with ci, c 2 , C3, x\, ®2 in a finite field F (p n ) of order p n with p an odd prime, cic 2 c 3 a ： ia ：2 
^ 0 in F{p n ); (ai,p n - 1) = fci, 0 < fci < p n - 1, (a 2 ,p n - 1,) = * 2 , 0 < fc 2 < 
p n - 1, Xfc, is a special ki -character, Xk 3 w o special kz-character, d = (fci,fc 2 ), d«i = 
ki, r(x) is defined as in (9b), then the relation (18) holds. 

① The proof of this theorem we gave as a direct application of our general theorem I of C 
to the special case s = 2. Another proof may be obtained by using the result of Davenport and 
Hasse, Jow fiir Math (Crelle) 172, p.174 relation (6.5) which gives the number of solutions of 
a« m + by n — c,abc / 0 in F(p n ), where x and y are nor restricted, as they were in our work, to 
non-zero values in F(p n ). 1V> do this we consider the possible solutions involving zero and subtract 
the number of them from the right-hand member of the relation (6.5) just referred to. We note that 
the number of solutions of ax m = c and bj/ n = c are, respectively, 

gv©; go. 

which when taken in connection with (6.5) yields a formula equivalent to our relation (18). 
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We shall now obtain N explicitly in terms of p, n and s, for special values of k\ 
when c s +i = 0. By the remark immediately following (9b), we may write 
lnd(o _, ) 


X M k ： (a- 1 ) = c^ 1 


(19) 
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we have a unique j/i, therefore, we have 

N s = (p n - 1 广 1 - Ns^. 

Consequently 

N a = {p n - l)- 1 -(p n - 1 广 2 + ••• + (-l)*(p n - 1) 

We also have, since the residue classes with respect to a rational prime modulus 
p form a finite field of order p, the 

Corollary I If N is the number of incongruent solutions in the y's of the 
congruence, with p an odd prime, 

dil/? 1 + +... + deVs' = 0 (mod p), 

where did 2 •■■d 3 异 0 (mod p), yiy 2 … y* 笋 0 (mod p); ki,kz, ■■- ,fc* prime each to 
each with (a<, p — 1) = A:<; 0 < < p — 1, * = 1,2, • • • , s, then 

((P-l)*' 1 + (-!)*)• 

P 

It may also be noted that the procedure used in the second proof of Theorem II 
gives us a method for determining the solutions of (22). 



Some properties of a sfield* 

All the results of this paper are initiated from the almost trivial identity that if 
ab ^ ba, then 

a = (6 -1 - (a- l) -1 6 _1 (a - l))(a -1 6 -1 a -(a - l) _1 6 _1 (a- 1)) _1 . (1) 

Among these results, there are two interesting ones which are the perfect form of two 
theorems due to H. Cartan ① and J. Dieudonne ® respectively. 

Theorem 1 Every sfield is generated by a non-central element and its conju¬ 
gates. 

Let K be the sfield and L be the conjugate set. The identity (1) asserts that if 
there is an element b of L such that ab ^ ba, then a belongs to the sfield K\ generated 
by L. That \s, K — K\ and Ki are commutative elementwise. 

Suppose our theorem is false; we have an element a oi K — K\ and two elements 
b and b' of K\ such that bb' ^ Vb. Since a and ab belong to K — K\, we have 
(ab)b' = b\ab) = al/b, which contradicts bbf ^ b'b. The theorem follows. 

An immediate consequence of Theorem 1 is the following interesting result: 

Theorem 2 Every proper normal subsfield of a sfield is contained in the center. 

H. Cartan proved the theorem under the assumption that the rank of the sfield 
over its center is finite. His proof is far more complicated than the present one. 

More precise results can be obtained by specializing the subgroup. 

Theorem 3 Every sfield, which is not a field, has no proper subsfield contain¬ 
ing all ita commutators. 

It is enough to prove the existence of a commutator which does not belong to 
the center. Suppose ab ^ ba. Ftorn a modification of the identity (1)： 

a = (1 - (o - - - (a - 1 广 W—Ha - l)b)~\ (2) 

we deduce that at least one of o -1 6 -1 a6 and (a — l) -1 b -l (o — l)b does belong to the 
center. 


As corollaries of Theorem 3, we have 



(J) Cartan H. Ai 
② Dieudonne. » 





Theorem 4 If the center of a sfield contains all the commutators of the sfidd, 
then the sfield is commutative. An element of a sfield commutative with all the com¬ 
mutators of the sfield belongs to the center. 

Moreover, as a simple consequence of 

a -1 c -1 ac = o -2 (ac -1 ) 2 c 2 (3) 


and Theorem 3, we have 

Theorem 5 Every sfield, which is not a field, has no proper subsfield contain¬ 
ing all its square elements. 

Theorem 6 If the center of a sfield contains all the square elements of the 
afield, then the afield is commutative. An element of a sfield commutative with all the 
square elements of the sfield belongs to the center. 

By means of an identity in the theory of finite differences: 

A fc-1 x fc = A:! (: c + 盖 (fc — 1)), 

where A/(i) = f(x + 1) - /(i) and A'f{x) = △(A i_1 /(;c))，we have 

Theorem 7 Every sfield of characteristic > k, which is not commutative, has 
no proper subsfield containing all its k-th power elements. 

Let us denote a~ 1 b~ 1 abby (a,b). An r-commutator (r ^ 2) is defined inductively 
by the relation 

(oi, •• - ,o r ) = (ai,(a 2l - .Or))- ⑷ 


Let C r be the set of all r-commutators of the sfield. 

Theorem 8 Let 2. Every sfield, which is not commutative, has no proper 
subsfield containing all its r-commutators. An element of a sfield which permutes with 
all the r-commutators of the sfield belongs to the center. 

Let A r and B r denote the two propositions of our theorem. B r follows from A r 
immediately. Theorems 3 and 5 assert that both A 2 and B 2 are true. We shall prove 
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central series can never go up. It is a comparatively difficult question about the 
derived series. Elsewhere I shall prove 

Theorem 9 Let 1. Every sfield ia generated by all the elements of the r-th 
derived group. 

An application of Theorem 1 is to establish the simplicity of the projective special 
linear group PSL n (JiT) ① of dimension n over the sfield K. 

Theorem 10 The projective special linear group PSL n (KJ is simple except 
when n = 2 and K has 2 or 3 elements. 

In case K is a field of characteristic ^2 or K is a complete field, the theorem was 
proved by Burnside-Jordan-Dickson (see Dickson , ② p. 85 and Van der Waerden , ③ 
p.7). For the incomplete field, it was proved by Iwasawa ④. For a sfield, Dieudonne 
proved the theorem for n > 2 , and for n = 2 ⑤ except in the case when the center of 
K has 2, 3 or 5 elements. Since both exceptional cases in Theorem 10 are not simple, 
it is the final answer to the problem about the simplicity of PSL„(/sT). 

Owing to the previous known results, we assume, from now on, that n = 2 and 
the sfield is not commutative. It is the same thing to prove that 

Theorem 11 Every normal subgroup of SL 2 (K) is contained in the center, 
except when K has 2 or 3 elements. 

Let A be an element of SL 2 (/T) which does not belong to the center. We are 
going to prove that A and its conjugates generate SL 2 (/C). We shall prove that there 
is a matrix Q belonging to SL 2 (K) such that QAQ~ X has a non-zero element at (1,2) 


position. In. fact, let A = 


a P 


I. If 峙 0 ， Q = 1 satisfies our requirement, and 


if 7 一 0, then Q = 


does. If = 7 = 0, we have an element x such that 


ax. xS, since A does not belong to the center. Then 


( 1 x \ 

( a 

f 1 

(* —ax + x6 \ 

lo ij 

lo s) 

V« 1 J 

.1* * ) 


We may assume that the subgroup generated by A and its conjugates contains an 

① For the definition and properties of PSL n (/f), see Dieudonn 冶.④ 

© Dickson L E. Linear Groups. Leipzig, 1901. _ 

® Van der Waerden B L. Gruppen von linearen Tranaformationen. Berlin, 1935: 7. 

④ Iwasawa M. Proc. Imp. Acad. Japan, 1941, 17 : 57. 

© The author had some difficulty in understanding Dieudonne's proof. In fact, all the parabolic 
elements of PSL 2 ( / <■) do not form a single conjugate set in PSL2(Jf). 


where k belongs to the commutator subgroup of K, so that d belongs to 0 L 2 I 


聊， ，毕卜厂 十， ）• 

Now we shall prove that we can choose « such that —0^k does not belong to 
center. For otherwise (―/ 37 « 1 ) - 1 (— ^ 7 ^ 2 ) = belongs to the center for 

two commutators and ^ 2 , that is, the commutators of k belong to the centf 
K. By Theorem 4, X is a field which contradicts our supposition. Therefore 
subgroup contains an element of the form C = ABA -1 B~ l = ( ° ] , whe 

\ c d J 

does not belong to the center. We have an element A of such that dA — Aa,' 
the subgroup contains 

/10 \ . o 

We may suppose that our subgroup contains ( | , c 一 0. 


lg to prove that the subgroup contains ( 丄 ^ J . la tact, 11 c = ± 1 , 

trivial. Let us suppose ±1. The field Ki, obtained from the 

idiunction of c, contains more than 3 elements. SL 2 (A"i) is generated 


conjugates generate SL 2 (i^i)- 
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1 o 2 A 2 a 2 + a 8 
. 0 


+ a 8 + a \ 

1 r 

° ) 

- i )- 2 ) 


(a+l ) 2 

0 (a+l)— 2 , 


we deduce that ( 。 J belongs to the subgroup. Since ^。 J and its conju¬ 
gates generate SL 2 (/^) as a runs over all elements of K, the theorem follows. 

The previous method can be used to establish the easier 
Theorem 12 Every normal subgroup of the general linear group GL n (l<r), 
which is not contained in the center, contains SL„(/T), except the cases when n = 2 
and K has 2 or 3 elements. 


consideration. Consequently the subgroup contains 

for all A. 

If the characteristic of the sfield is not equal to 2, then the subgroup contains all 
the elements of the form 


( 1 o\ 

/ 1 0、 _ 

7 1 

-( 1 °\ 

V(A + ” 2 ij 



u 入 w 


Therefore we have the theorem. 

If the characteristic of the sfield equals 2, since 

(1 (A + 1)- 2 V 1 0Vl lV 1 o\ = /(A + i)- 2 0 \ 

\0 1 八 A 2 1 八 0 1 八 (1 + A-”_ 2 1 广 、 0 (A + l) 2 / 

the subgroup contains elements of the form 

(o : -0 

for all a. Let a be any element of the sfield. From 


(1 a 、 




(叫 - 1 

卜 2 

0 ) 

U 1 

u 

«- 2 J 

U i J 

U ij 

l 0 

a 2 J 





On the generators of the symplectic modular group* 


Introduction 

Let n be a positive integer. Throughout this paper, unless the contrary is stated, 
we shall use capital Latin letters to denote n-rowed matrices and capital German let¬ 
ters to denote 2n-rowed matrices. Furthermore, an r-rowed matrix R will be denoted 
by Let 



where 1 and 0 denote the identity and zero matrices respectively. Let T be the group 
of all matrices £01 with rational integral elements which satisy 

OT 卿 , =(1) 

where denotes the transpose of OT. Let ro be the factor group of T over its centrum; 
r。is called the symplectic modular group. It can be thought of as being obtained 
from T by identifying the elements 971 and -OTl. In applications to modular functions 
of the nth degree® and to the projective geometry of matrices® it is customary to 
identify SOI and —as a single transformation. For this reason we have considered To 
rather than T; it might be pointed out, however, that the generators of To obtained 
in this paper happen to be a set of generators of T. 

It is the aim of this paper to find the generators of the symplectic modular 
group. It will be proved here that this group is generated by two or four independent 
elements, according as n = 1 or n > 1. The method used here can be extended so 
as to give a set of generators for matrices with elements in any Euclidean ring. In 
particular, we give the details for the generalized Picard group at the end of this 
paper. 

*Presented to the Society, February 28, 1948; received by the editors March 6,1948. (By Hua L 
K and Reiner I). Reprinted from Tranaactivns oj the American Mathematical Society, (1949, 65(3): 
415-426.) 

① Siegel C L. Math. Ann, 1939, 116: 617-657. 

② Hua L K. Trans. Amer. Math. Soc., 1945, 57: 441-490. 
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Problems of this type have been considered previously. Poincare® stated without 
proof that every matrix with integral elements for which = ®, where 0 is 


_ product 

set of matrices. FVom the results given in the present paper, a much stronger form of 
Brahana’s result can be easily deduced. 


(1) is equivalent to 


AB' = BA', Ciy = DC\ AD'- BC'= 


By taking inverses of both sides of (1) and using 衣一 1 = —5, we can deduce that 
= 5, so that 

A'C = C'A, B'D = D'B, A'D- C'B = I. (4) 


We shall begin by showing in §3 that T 0 is generated by the following types of 
elements: 

⑴ Translations: 

1 

where 5 is symmetric. 

(II) Rotations: 

« = 

where U is unimodular, that is, abs U=1 (where abs U denotes the absolute value of 
the determinant of U)- 





① Poincare H. Rend. Circ. Mat. Palermo, 1904, 18: 45-110. 

② Brahana H R. Ann. of Math, 1923, 24 ⑵： 265-270. 




wnere J is a diagonal matrix wnose diagonal elements are u s ana 丄 ’s’ so tnat = J 
md (/ - J) 2 = / - J. 

It is easily verified that matrices of types I, II and III satisfy (1). 

2. In this section we prove two lemmas on matrices. 

Lemma 1 Let m he a nonzero integer, and let T be an n-rowed symmetric 
matrix at least one of whose elements is not divisible by m. There exists a symmetric 
matrix S with integral elements such that 

0 < abs(T- mS)< |m| n . (5) 

Proof The lemma is evident for n = 1. Consider next n = 2; let T = {Uj), 
S = (Sij). Then 

ab8(r — mS) = |(tn — msu)(t22 — OW 22 ) — (*12 — m8 i2) 2 |. ⑹ 

If m divides both in and & 2 , it cannot divide < 12 ; we can then choose S so that 
tn — msu = <22 - m «22 = 0 and 0 < |<ia - msn\ < |m|. Suppose on the other hand 
that m does not divide one of the diagonal elements, say <n. Fix «i 2 arbitrarily, and 
choose si 1 so that 0 < Ifn — msn| < |m|. Since (6) can be written as 
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where - - - represents terms not involving Skk- Choose Sik arbitrarily for l = 1,2, , 

A: — 1,A: +1, ••- ,r. Then by the Euclidean algorithm we can choose Sfcfc so that 

0 < abs(T — mS) ^ |m|abs(Ti — mSi) < |m| r . 


This completes the proof of the lemma. 

Lemma 2 Let A and B satisfy AB' = BA' and let det 0. There exists a 
symmetric matrix S such that either 

B - AS = 0, (9) 

or 

0 < abs(5 - AS) < absA (10) 

Proof Prom AB' = BA' and det i4 ^ 0, we may deduce that A'B is sym¬ 
metric, where A" denotes the adjoint of A. We apply Lemma 1 with T = A* B and 
m =det A. Either every element of A*B is divisible by m, in which case there exists 
a symmetric matrix S with A m B = mS, or else there exist symmetric matrices R 
and 5 such that A*B = mS + R with 0 < abs < |m|". In virtue of the relation 
A*A = ml, these alternatives become: either B = i45 (in which case (9) holds), or 
B — AS — AR/m; however, 

L AR (abSi4)(abs/2) _ absiZ 

ab8 "^" = ~ \m \^ _ = hF 1 ' 

so that 

0 < abe(fi - AS) < |m| = absA 

3. We are now ready to show that To is generated by matrices of types I, II and 
HI. Let 9JI given by (2) be an arbitrary element of To- It suffices to prove that by 

multiplying 9Jt by matrices of types I, II and III, one obtains a product of matrices 

of those types. 

(3) implies that not both A eind B are 0. Since 


(A 


( 0 


(- B A ) 

u 

D) 

u 

oj' 

\ * *) 


we may assume that A has rank r > 0. Furthermore, 

(u 0 W A B\(v 0 U UAV * 

( 0 t/，- 1 八 C D 八 0 V ,-1 j _ V * * 







alternative occurs, we proceed as follows: let 


( 13 ) 



Then 

A B 
C D 

where 

~A = AJ-B(I-J)=y ° J . 

We now repeat the process as before, and so on. Since there are only finitely many 
positive integers less than abs Ai, this process eventually terminates. Thus, by mul¬ 
tiplying 9K by matrices of types I, II and III one arrives at a matrix 
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and det iZ 一 0. One readily deduces from AqBq — BqA!q that Bo must be of the form 


/ W A) B 0 、 （ 

0 J { Co Do )\. 


where J is given by (13). Finally we notice that for a matrix 


of To, we must have A = U unimodular, D = U'~ l , and thence from (3), B = SU'~ l 
with symmetric S. Therefore 


^) = (^)(o 


This completes the proof that T 0 is generated by the matrices of types I, II and III. 

4. The set of matrices of types I, II and III which generate r。are certainly not 
independent generators. Let us reduce the number of generators as much as possible. 


^■)(ot) = (o 


the subgroup formed by matrices of type I is generated by those type I matrices whose 
S's are given by 
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so that the group generated by matrices of types I and II is the same as that generated 
by all type II matrices and the two translations whose S's are given by (15). However, 
we have 



Hence the translation with Si is obtainable from that with So and the matrices of 
type II. Therefore To is generated by the matrix 




0 



(16) 


with So given by (15), and all matrices of types II and III. 
Since 


(u 0 N 

0 \ 

J uv 0 ) 

V 0 u '- 1 ) 

V 0 V '- 1 ) 

、 0 {uvy - 1 ) 


in order to find the generators of the subgroup of rotations we have merely to find 
the generators of the group of unimodular matrices. These are given by the following 
theorem. 

Theorem 1 Let n ^ 2. Every unimodular matrix with rational integral ele¬ 
ments is a product of the matrices U\, U 2 , U 3 and their inverses, where 



(17) 


Proof It is known ① that every unimodular matrix is a product of Ui,Uz, U 3 , 
① See for example, MacDufBee C C. The theory of matrices. Berlin, 1933: 34, Theorem 22.5. 
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t；i(r- 1 ) n - 2 c/f 1 r n - 


From this we see that 


T n ~ 3 Ui(T~ 1 ) n ~ i Ui 1 T n ~ 


(T~ 1 ) n ~ 2 UiT n ~ 3 Ui(T~ 1 ) n ~ 2 U^ 1 T n ~ 1 x= r 2 + ri 


i/t = c7f 1 (r _1 ) n-2 t/ir n - 3 c/i(r -1 ) n_2 tif 1 T n_1 . 


where + denotes the direct sum of matrices. But from 


- u ^=(\ -： 
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This completes the proof of the theorem. 

Corollary Let 2. Every unimodular matrix with rational integral elements 
of determinant +1 is a product of powers of U 2 and 

/ 0 ••- 0 (-l) n_1 \ 

1 … 0 0 

u,= ： : : • 

0 ••• 0 0 

\ 0 ••- 1 0 ) 


By Theorem 1 we see now that To is generated by To and the set of all semi- 
involutions and the three rotations defined by 





Ui 0 

0 u ;- 1 


i = 1,2,3. 


(18) 


We finally consider type III matrices. Let J r be the diagonal matrix obtained 
from the identity matrix by replacing the rth 1 by 0. In that case, if r ^ a, we have 


/ Jr I~Jr\ 

( V 

_ ( Jrs I~Jrs\ 

\ Jr~ 1 Jr ) 

V Je-I Js J 

\ Jr.~ I Jr.) 


where */ r , is obtained from the identity matrix by replacing the rth and sth ones 
by 0,8. Therefore, in order to obtain all type III matrices, we need only those semi- 


= 1 , 2 ,- 


(19) 


involutions 、 

/ Jr I-Jr 
\Jr-/ Jr , 

with J r defined above. Now, let U be that unimodular matrix obtained from I by 
interchanging the 1st and rth rows; then we have 


(u 0 \ 

(Jr I-Jr\ 


Jl 

\ 0 U'~ l ) 

\ Jr~ T J r ) 

V 0 ^ J 

\Jl-I Jl J 


Therefore I'd is generated by the matrices To, 9li (i = 1,2,3) and the matrix 


©0 


Ji 

Ji-I 



(20) 


with J, previously defined. But 


60 = «3, 



so that 5 H 3 maybe dropped from the list of generators. Therefore we have the following 
theorem. 

Theorem 2 Tq is generated by the four matrices To, 9 li, 9 l 2 and ©0 given by 
(15), (18) and (20), for n > 1. For n = 1, To ts generated by To and ©o- 

5. In this section we shall prove the independence of the generators given in 
Theorem 2. For n = 1, this is trivial because 60 is of finite order while To is not. 
Hereafter we suppose that n > 1. 

(1) Independence ofXo. We consider the transformation 

(U) = (x ， y)OT; ( 21 ) 

if XY' is symmetric, it is easily verified that X\Y{ is also symmetric. We shall show 
that if the diagonal elements of XY' are even, those of X{Y{ will also be even if is 
汛 1 , 识 2 , or 60 , while if OT = To it is possible to choose X and Y so that some diagonal 
element of X{Yl is odd. This will show that To is not expressible as a product of 9ti, 
JH 2 and @o and their inverses. 

Assume now that the diagonal elements of XY' are even. FVom (21) one readily 
deduces that if 371 is a rotation, X{Y{ = XY', so that the diagonal elements of X\Y{ 
are also even. If secondly SOI is a semi-involution, we have 

Xi = xj + y(i-j), yi = -x(/-j) + yj, 

so that 

X^Yi = XJY' - Y(I- J)X' = XJY' + YJX' - YX'. 

Since XJY' ia the transpose of YJX', it is again clear that the diagonal elements of 
X{Yl are even. Finally, Suppose OT = To- Then we obtain 

X{Yi = X(XS 0 + YY = XY' + XSqX' 

and for X = / the first diagonal element of X{Y{ is odd. This completes the proof of 
the independence of 5 0 - We may remark in passing, however, that Xg is expressible 
as a product of the powers of 9li, 912 and ©o- 

(2) Independence of SRi- Let t = (n,- - - ••- , »„/ be a column vector 

with 2n components. It is clear that the second component r-i is unaffected when r i8 
multiplied on the left by any of the matrices I 。， ^ 2 , and ©0 and their inverse. Under 
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(3) Independence Multiplying t on the left by or 6 。 or their inverses 

permutes components of t; under any such permutation, however, any n and its 
corresponding Sj remain n components apart. Since the effect of multiplying on the 
left by To is to replace q by n + Sj, it is clear that by multiplying t on the left 
by a product of IRi, ©o and To and their inverses, ri may be replaced by a linear 
combination of r\ and Si and its position may be changed. It is however impossible 
to replace n by ri + in this way, and this is exactly the effect of multiplication of 
t on the left by W 2 . This proves the independence of 912. 

(4) Independence of&o- We note that 



Since Jo ，讲 1 and W2 and their inverses are all of the form 

(: :) 


and 60 is not of this form, it is clear that 60 is not expressible as a product of To> 
and IW 2 and their inverse. 

6 . Our previous method can be extended to any Euclidean ring; in particular, 
for the ring formed by the Gaussian integers, we have the following result: 

Theorem 3 Let T 7 be the group of matrices DJI with Gaussian integers as 
elements which satisfy (1). LetT r 0 be obtained from, T' by identifying the four elements 
± VJl and ± Then for n > 1, Tq w generated by the matrices ^2 and 

So defined previously, and the matrix 



where Si = iSo- 


( 22 ) 


For n = 1, Tq is generated by lo. and ©o- 

The independence of the generators is shown as follows (with suitable modifica¬ 
tion when n = 1 ): 

(1) Independence o/X。. We use the method of §5, (1). Let Xy 7 be a symmetric 
matrix with Gaussian integers as elements, such that the real part of each diagonal 
element is even. This property is preserved when (X,Y) is subjected to the trans- 
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(2) Independence ofli ，This is clear since is the only generator which is not 

real. 

(3) The independence of Wi, W 2 and ©0 follows exactly as before. 
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Geometry of symmetric matrices over any field with 
characteristic other than two* 

1. Introduction 

It is the aim of this paper to establish a generalization of von Staudfs theo¬ 
rem for the geometry of symmetric matrices over any field with characteristic 笋 2. 
Before stating the theorem explicitly, we explain some notations which will be used 
throughout the paper. 

Let $ be a field with characteristic ^2. We use capital Latin letters to denote n- 
rowed matrices over and I and 0 to denote the n-rowed identity and zero matrices 
respectively. For matrices which are not n x n, we use A/(’， m ) to denote an Z x m 
matrix and M( TO > = M' denotes the transposed matrix of M. We use, also, 

small Greek letters a and r as superscripts to denote automorphisms of the field 
that is, for any two elements a and b of we have 

(a + b)° = a° + b a , (aby = a°b a . 

Let 



A pair of matrices (X, y) is called nonsingular and symmetric, if (X,Y) is of 
rank n and 

(x ， y)s(u)' = o. ( 1 ) 

We shall say that two nonsingular symmetric pairs (X,Y) and (Xi, Y{) belong to the 
same class, if and only if there exists a nonsingulax Q such that 

( 不，叫 = 0( 义 ，50 (2) 

holds. A class of nonsingular symmetric pairs of matrices will be called a point 
P = V")}. The totality of these points forms a projective space of symmetric 

TTllltT%CCSi 


* Reprinted from Annals of Mathematics, 1949, 50(1): 8-31. 
Received February 6, 1947, Revised October 7, 1947. 
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The rank of 

<p, 朽 > = (x，y) 苫 (uy 

is called the arithmetic distance between the two points P and Pj represented by 
(X, Y) and (Xi, yi) respectively. It is denoted by r(P,Pi). Evidently this notion is 
independent of the particular choice of the elements from the same class. It is also 
obvious that two points coincide if and only if their arithmetic distance is zero. 

Our problem is to find all the transformations carrying the projective space of 
symmetric matrices onto itself and keeping arithmetic distance invariant. 

Evidently, the symplectic transformations 

(Xx ， K) = Q(X, Y)T, Tyr' = 5 ， (3) 

satisfy our requirement. But this does not tell the whole truth; in fact, the transfor¬ 
mations 

(X u Y 1 ) = Q(X ir ,Y a ) (4) 

also satisfy our requirement. 

In order to find the full truth, we have to extend the notion of the symplectic 
group. We consider the transformation 

(Xi ,yi) = Q{x, y)i, lyr' = ay, (5) 

where a is an element ^ 0. Further, since 

{bZ^bV)' = b 2 aS 

and 

(X 1 ,Y 1 ) = Qb~ 1 (X,Y)b% 

only those a which represent different classes of the factor group of the multiplicative 
group of $ modulo the multiplicative group of all square elements of $ are essentially 
taken into consideration. The transformations (5) are called extended symplectic 
transformations, and they form a group to be called the extended symplectic group. It 
is apparent that the factor group of the extended symplectic group over the symplectic 
group is isomorphic to the factor group $/ 少 2 . 

It will be proved in this paper that, for n > 1, 

Euery transformation carrying the projective space of symmetric matrices of or¬ 
der n >1 onto itself and keeping arithmetic distance invariant is of the form 


(Xj^) = Q(X?,Yn% 


⑹ 
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where Q is a nonsingular n x n matrix, a is an automorphism o/ X is a 2ra x 2n 
matrix satisfying 

= aS 

and a runs over a complete residue system of 企 /$ 2 . 

In nonhomogeneous coordinates (Z = y _1 _?f) ， (6) takes the form 

Zi= a(AZ° + B){CZ a + D)~\ ⑺ 

where AB' = BA\ CU = DC', AD' - BC' = I. 

If, in particular, $ is the real field, then it has no automorphism other than 
identity, and +1 and —1 are the representatives of the factor group $/# 2 . Therefore 
we deduce that any transformation carrying the projective space of real symmetric 
matrices of order n > 1 onto itself and keeping arithmetic distance invariant is of the 
form 

Zx = {AZ + B){CZ-¥D)~\ 
where AB' = BA', CD 1 = DC', AD' - BC' = ±1. 

A comparison with the theorem stated in I® reveals that we have eliminated the 
conditions on “continuity” ， “sense” and “harmonic separation”. This is the final goal 
of our algebraic treatment. 

Notice that, for n = 1, the theorem is no longer true. An additional condition 
concerning harmonic separation is needed. For a field of characteristic 2, the theorem 
is not true for n = 1 even with the condition on harmonic separation. 

Before proceeding to the text, it may be worthwhile to mention several connec¬ 
tions of the theorem with various fields of mathematics. 

Algebraists may have some interest in the following theorem which is suggested by 
the “affine geometry" of symmetric matrices. Consider only the “finite” points {X, y); 
that is, pairs for which Y is nonsingular. They can be represented in nonhomogeneous 
coordinates Z (= y -1 X). The arithmetic distance between two points Z and Z\ is 
simply the rank of the difference Z — Zi- Then we have 

Let E be the set of all symmetric matrices of order n >2 over the field $ with 

characteristic #2. Any mapping of E onto itself leaving the rank of the difference of 

two matrices invariant is of the form 

① Geometries of Matrices. TVans. Amer. Math. Soc., 1945, 57: 441-481. In particular, p.4S9. 
The papers under the same title published in Trans. Amer. Math. Soc., 1945, 57: 441-481; 1947 ， 

61: 193-228, 229-255 will be referred as I, II and III respectively. Compare also C. R. {Doklady) 

Acad. Sci. URSS (N.S.), 1946, 53: 95-97. 
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Z x = aAZ^A' + S, S = S', (8) 

where a 6 $ and a is an automorphism of 

This theorem assures the algebraists that their study of symmetric matrices 

under the congruence group is really a very general one. We shall proceed to prove 

this theorem by purely algebmc methods, and then we shall deduce our fundamental 

theorem of the projective geometry of symmetric matrices from it. 

From the geometrical point of view, the case n = 2 is very interesting. Let 
(ij, 12 , 2 : 3 ! X 4 ) and (yi,y 2 .V 3 j Va) be two distinct points of ordinary projective 3-space. 
We use 

(X,Y)=( Xl X2 X3 XA ) 

\ yi 2/2 V 3 i/4 / 

to denotet the line passing through both points. Two pairs of matrices (X, Y) and 
(_Xi, y*i) represent the same line if and only if we have a nonsingular matrix Q such 
that 

(X,Y) = QiXuYt). 

The lines (X, Y) which satisfy 

(X,T)S(X,Y)' = 0 

form a line complex. Thus a point of the projective space of 2-rowed symmetric 
matrices denotes a line on the complex. For two lines (X,Y) and [X\,Y\) the rank 
zi ( 久， y)ff(A"i ， yi)’ is either 0 or 1 or 2; its significance is that the lines coincide, 

二 tersect，or do not intersect, respectively. Therefore the present statement in its most 

cpecific form gives us a rational treatment of the line geometry on a line complex. It 

seems also to be new. Notice that the line geometry on a line complex iB identical 

with Lie’s geometry of circles and that Laguerre's geometry of circles is the “affine 
geometry" of 2-rowed symmetric matrices. This indicates still other applications of 
both the homogeneous and nonhomogeneous forms of our theorem. 


2. Subspaces 

In §2~§6, we consider only the finite points of the projective space of symmetric 
matrices. Thus there will be no confusion if we use a symmetric matrix to denote a 
point. Throughout §2~§4, we assume that n = 2. 
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forms a subspace spanned by S and T which will be denoted by 11(5, T). 
We have a transformation of the form ( 8 ) carrying 5 and T into 


0 , 


: To 0 

0 0 , 


Xo ^ 0 , 

respectively. Thus 11(5, T) is equivalent under ( 8 ) to 


n H -； 


I y 

、y z , 


be a point of II 


implies 




r(0,P)<l, 


U 斗 


xz -y 2 = [x- Xq)z — y 2 = 0. 


Consequently we have z = 0 and y = 0, that is, 
matrices of the form 

(；：) 


-K ： o)) 


consists of all the 


⑼ 


Therefore every subspace can be carried by a transformation of the form (8) into 
)which has the explicit form given by (9). 


n W 二 : 


Two subspaces are said to be complemented if they have one and only one point 
in common. 

Theorem 1 Every pair of complemented subspaces can be carried simultane¬ 
ously into 


+(“))，- Ho ：： 


by a transformation of the form (8). 

Proof We may suppose that one of the subspaces is 
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that the common point is 0. Let U be a point in the other subspace different from 0. 
Then, since r(0, U) — 1, we have 


The transformation 


^(rrHrry 

I ( 0, [ I I onto itself and carries U into 

v vo °// 


The space spanned by 0 and [ ^ |isn[o,( 。 )1， which provesthe theo- 

V 0 u / V V 0 1 // 

rem. 

Consider a point Z, outside a pair of complemented subspaces III and II 2 , whose 
arithmetic distance from the common point of IIi and Il 2 is 2. Suppose that ^ is so 
chosen that there is a point Pi of Ill satisfying 


and that for each point Q of there exists a p>oint P 2 of II 2 such that r{P^,Q) 

=1. As Q runs over all elements of the set of all points in the subspaces 

Q) form what we shall call a reducible subspace S. IIi and II 2 will be called the 
components of S. 

Theorem 2 Every reducible subspace can be carried by a transformation of 
(8) into the set of matrices 


where x and z run over all elements of the field. 
To prove the theorem, we may assume that 


•242 - 


华罗庚文集代数卷 II 


Then 

z= (: :)’ 。+ 

To ensure the existence of P \, we have to assume that c ^ 0 in which case 

Pi = 

Then II(Pi, Z) is the set of points: 


a — b 2 /c 0 

0 0 t 


Q = (i-t) 


a — b^/c 0 


H: 


where t runs over all elements of 

The existence of P 2 in II 2 asserts that 
6 2 


(1 —t)^a- + t + a^O, 


c c 

for all t. Consequently b = 0, ac^O and 


P2 = 


0 0 
0 tc , 


Now Q takes the form 


and the space spanned by Q and P 2 consists of the points 

f sa 0 \ 
l 0 tc )' 


This proves the theorem. 

3. Construction of involutions 

We begin by de&ning an involution. Let be a reducible subspace and Z a point 
not on S. Let A be the set of elements X on 17 for which 


r(X,Z)^l. 



r{X,Z) < 1 


for all X belonging to A. This defines Z for every Z not on 17. If 2 is on E, let Z = Z. 
The mapping Z —»Z is called an involution of the first kind. 

If S has the form (10) and 




(a — ®)(c — z) — 6 2 = 0. 

That is, A is the set of elements satisfying (11). Let 


then (11) implies 

(a — x)(c — z) — b 2 =0. 

Consequently a = a, c = c and b 2 = 6 2 . Then b = —b. 

Analytically, the mapping can therefore be expressed in the form 


Theorem 3 The mapping Z —* Z is an involution of the first kind if and only 
Z = AZA! + 5, A 2 = I, ASA' + S = 0. (13) 


Z = BWB' +T 


Z = BWB' + T. 
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W=B~ l l 


1 0 

0 -1 


+B 'i{i - 1 ) t {i 二 )- 十 ' 


(14) 


which is evidently of the form (13). 

Conversely, since A 2 — /, we can find B such that 


1 0 

0 -1 


(or ±1, but these 
S 


BAB~ 

s give only the identity mapping). Further 




where 

T = -^BSB'. 

That is, (13) can be expressed as (14), which proves the theorem. 

Theorem 3 shows that an involution of the first kind is uniquely determined by 
a reducible subspace and that every point of this reducible space is invariant under 
the involution. 

Two different involutions of the first kind are said to be commutative if one 
carries the reducible subspace of the other onto itself. 

Theorem 4 Any pair of commutative involutions of the first kind can be car¬ 
ried simultaneously into (12) and 


Z 


0 




(15) 


Proof Let one of the involutions be (12). The other carries the reducible 
subspace 



onto itself. Let 


A 


a\ 

03 


02 

04 


S 


s\ 

S2 




Since A is nonsingular, we have either 


The second case cannot happen, as otherwise the second involution coincides with 
the first. Thus we have ai = (14 = 0, and 


0 b 、 
1/6 0 , 

l z ( 

f 0 b 
、 1/6 0 


( -sb 2 0 
\ 0 s 

(：：) 

\Wi 

(：：) 


(-b 2 a 0 \ 

" 0 J 

(。) 

1 W : 

l::] 


( - 内 0 、 
{ 0 0 ) 


we have (12) and (15) with W instead of Z. 

The product of two commutative involutions of the first kind is called an invo¬ 
lution of the second kind. Therefore an involution of the second kind can be carried 

int0 / x / \ 

^ f 0 -1 \ 


oH^o ： 


The fixed elements of this transformation are of the form 

1 0 \ 


The manifold of the fixed elements of an involution of the second kind is called a 
chain. 
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We are now going to examine all the chains in a reducible subspace. Notice that 
(16) represents all the involutions of the first kind which are commutative with (12). 
The product of (12) and (16) is 


~Z = 



0 

s 


The chain in the reducible space 


x 

0 



is therefore of the form 


b\t-a) 

0 



where t runs over all elements of We put in more parameters to make it similar to 
a straight line. The chain is also equivalent to 


/ ot + b 0 
y 0 ct + d 


(19) 


if a = cx 2 is soluble. 

In the next section, we shall obtain harmonic separation by means of the con¬ 
struction of complete quadrateral. But we should notice that we have neither the fact 
that passing through any two points there is a “line” nor the fact that any two “lines” 
have a point of intersection. 

The chains passing through a point (: ^ ^ are of the form (19). The chain 
passing through two points 


a 

0 



0 

d 


is of the form 



if it exists. The condition for existence is that 


( 20 ) 


(a _ c)/(b - d) 


( 21 ) 


is a square element in 


4. Proof of the theorem for ra = 2 


Theorem 5 Every mapping carrying symmetric matrices into symmetric ma¬ 
trices and keeping the rank of differences invariant is a combination of a homothetic 
transformation 

Z\ = aZ, 

a translation 

Zi = Z + S, 5’ = 5 ， 

an automorphism of the field 

Zi = Z° 

and a congruence relation 

Zi = AZA', det ( 乂 ） / 0. 

This is practically the theorem stated in the introduction. In this section, we 
shall give a proof for n = 2. 

Notice that the notion of reduced subspaces and chains depends only on the 
notion of arithmetic distance. 

Let T(Z) be a mapping of the type under consideration. It is evidently one to 
one. Without loss of generality, we may assume that it carries the reducible subspace 

/ x 0 
V 0 « 

/ 1 0 

onto itself and the chain t 

1 

r ⑼= 0 ， r(/) = /. 


) onto itself, and moreover, that 


T(al) = a°I. 

We shall prove that ct is an automorphism of the field 
1) (a + b) a =a a + b a . 

Consider only the chains in the reducible subspace 
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Any diain, different from tl, passing through 0 can be expressed a 


Any chain which does not intersect tl can be expressed as 


The intersection of (22) and (23) is given by 


The chain passing through al and (24) is of the form 


+ T^?(o P °0 (1_<) ' 

{ a -T^)/{ a ~Yrp) 


is a square element in 

FVora bl, we draw a chain which does not intersect (22); it has the form 


The intersection of (27) and (23) is given by 


T^(o 


FVom (28), we draw a chain which does not intersect (25); it has the form 

: 0 占 )* + ☆(: : 2 ) +w ’* 64> . (29) 

The existence of (29) is assured by (26). This chain intersects tl at the point (a+b)I. 
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operation T. The existence of p and q satisfying (26) is assured since (26) is linear in 
q. Therefore we have 

T(al + bl) = r(a/) + r(w). 


2) (ab) a = a°b a . 

From Q we draw a chain 


(: 



t 6 


From /, we draw another chain 


(I 



tei. 


Their intersection is given by 


q 2 ~l / 1 0 \ 
9 2 - P 2 \ 0 p 2 J 


(30) 


(31) 


(32) 


Connecting (32) and bl, we have 





t € 办， 


(33) 


provided that 




(34) 


is a square element r 2 . 

FVom al we draw a chain not intersecting (31 )， 


1 0 
0 . 


(35) 


which intersects (30) at 

q 2 ~l „ 广 1 0 、 

9^ U P 2 人 

From (36), we draw a chain not intersecting (33), 

p0 _u ) + ^°(o pO 


(36) 


(37) 
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This intersects the ch^n tl at abl. The existence of (37) is assured by (34). 
Therefore we have 

(ab)° = a a b a , 

provided that we can justify that (34) is always soluble in p, q, r. That is, for 
we need p, q, r in the field such that 

L q 2 — 1 1 — p 2 r 2 
b= q 3 -p 2 1-r 2 ' 

For any b, there exist two elements u and v of the field such that 


b = uv, —2uv + u + v ^0. 
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We may assume that 


for otherwise, we can use 






0 

0 


lies on the subspace II 



we have 



for all a: € $■ Similarly, we have 



Since 


but 


we have inmediately 


Now we let 


we consider these 


x 

0 
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that is 

(a — i)(c — z) — b 2 = 0. 

By (41) and (42)，this equation implies 

(a •- x)(c* - z) - 6* 2 = 0 
and vice versa. Thus we have 

a = a m , c = c*, b = ±b* • 



we may assume that 



Suppose that there exists 6^0, such that 



By (43), we should then have 

(x — a)(z — c) — (1 — 6) 2 = 0, 

which implies 

(i — a)(z — c) — (1 + b) 2 — 0. 

This is impossible, since the characteristic of the field is different from 2. Therefore 



and the theorem is proved for n = 2. 

Remark The theorem seems still to be true for the field with characteristic 2. 
For the prime field with characteristic 2, the proof is quite simple. The space contains 
only eight points 
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We may assume that 




0 

0 


The only singular matrix which has not been taken into consideration is | 
Therefore 


：：) = (：：)' 

aent satisfying 

o, (“ ))m= 


I is the only element satisfying 


we have 



Finally, we have 


5. Subspaces 




In order to extend our notion of subspaces to higher dimensions, we introduce 
the concept of a dieder manifold. 

Definition Let P and 。 be two symmetric matrices of order n. The symmetric 
matrices X satisfying 

r(P,X) + r(X,Q) = r(P,Q) (44) 

form a manifold which is called a dieder manifold spanned by the points P and Q. 
The arithmetic distance between P and Q is called the extent of the dieder manifold. 
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Since the pairs of points having a fixed arithmetic distance form a transitive 
set®, the dieder manifolds of the same extent form a transitive set. 

Theorem 6 If p < n, the points on the dieder manifold spanned by 


0, 


J(p) o 

0 0 


are of the form 


f 0 \ 

k 0 0 人 

Proof Let ^ be a point on the dieder manifold and 

X 12 ' 

、 X } 2 x 22 , 


x = 


X l2 =X^ n - p \ X 22 = X^~ P) - 


(45) 


p = r(P,X) + r(X,Q)> r(0,X n ) + r(X n ,I M )> r(0,/^)=p, 

we have 

r(0,X u ) + r(A ： n ,/(» , >)=p. (46) 

There is a nonsingular p-rowed matrix T such that rXiiT -1 is of the normal 
form. In virtue of (46), we find that the normal form becomes 


rA-nr- 1 = 


' I M 0 

0 0, 


0 < g < p. 


(47) 


We shall now prove that X 12 — 0, X 22 = 0. Let 


be a (p + l)-rowed principal minor of X; it is sufficient to prove that v = 0, o = 0. 
From (44), we deduce in the same way used to establish (46), that 

p = r(p,X) + r(X,Q)^ r(0^\X o ) + r(x 0 ,^ 1 ^ ^ > P- 

Consequently, we have 


r(0^ +1 >,X 0 ) = r(0^,X n ) = <7 


(48) 


① I， Theorem 2. 



is of rank g, we deduce immediately that -ua = 0, tua = 0, and a — uitwj = 0. 

Using (49), we see also that 圯 = 0, tin = 0 and a - U 2 v/ 2 = 0. Therefore v = 0 
and a = 0. This establishes our theorem. 

Definition Let p < n. A set of points is said to form a normal subspace of 
rank p, if every pair of points of the set has arithmetic distance $ p, if it contains two 
points of arithmetic distance p, and if it contains all dieder manifolds spanned by any 
two points of the set. 

Theorem 7 Normal subspaces of the same rank form a transitive set; more 
precisely, every normal subspace of rank p can be transformed, by (8) into the normal 
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6. Proof of Theorem 5 in general 

We now suppose that n > 3. Let 

r(^) = Zi 

be the mapping under consideration. Suppose that 

T(0) = 0. 


The points of the form 

{V o) 

form a normal subspace of rank n — 1. Since the arithmetic distance is invariant, the 
set of points 

r (W-i) o) 

constitutes also a normal subspace of rank n— 1. Since the set of all normal subspaces 
forms a transitive set (Theorem 7), we may suppose that 

。 = k 0 。 人 

Thus r induces a mapping on (n — l)-rowed symmetric matrices, and it keeps arith¬ 
metic distance invariant. By the hypothesis of induction, we have 

W = aaW^ol + n, Q = a (n - 1) , 〆 ="= 

Then the mapping 

Z = aAZ^A' + S, 


where 



carries the mapping f(Z) into a new one with 


V 0 


0 \ ( W( n-1 ) 0 

oJ = l 0 1 


(51) 


Since 


0 < n_1 ) 

0 


0 
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of rank 2. The mapping T induces an automorphic mapping on two-rowed symmetric 


,(0) = 0, 七 ：) = (::),；) = (" °) 

Since the theorem is true for n = 2, we have, by the first equation of (54), 


where a is either 1 or a nonsquare element of From the second and third equations 
of (54), we deduce 

.f b 0\ 


for all x. If in particular x = 1, we see that a and e must be square elements, 
whence a = e = 1, and 6 2 = c 2 = 1. Consequently x a — x, that is, a is the 
identity automorphism. Substantially, we have only two cases: (1) 6 = c = 1 and (2) 


The mapping Y leaves 


/(r»-2) o 

0 1 ⑴ 


0 ⑴ 0 

0 o (n 一 2) 
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0 

0 w^ n - ] 


二 H 


0 0 
0 Wb- 1 ) 


since two points of (56) span the normal subspace in the bracket. We have by the 
hypothesis of induction that 

W = aAVT/^ + S, A = A^-^, S = = S', 

where a is either 1 or a non-square element. By (51) and (55), it leaves 


W = 


' x( n - 2 ) 0 ' 
0 0 , 


pointwise invariant, whence 5 = 0, a = l,r 
= I or 

For the former case we have 

r 

For the latter case 


0 0 
0 


H 


Zi = 


0 

0 -1 


■ 0("-2> 0 、 
0 X 


'/( n ~ 2 ) 0 
0 -1 


0 0 
0 


’ J( n -” o 
0 -1 


carries T into a new transformation which satisfies (51), (55) and (57). 
As in III, p. 238, we deduce 

r(( 〜 ) ） =(40 ， Zii = 4. z m = ±2ln 


(57) 


% = for (*.i) ^ (!.«)■ 

Suppose that there exists a Z = (zy) such that 
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•Zll — Zln . • • ^l,n-l —之 1j> 0 

、 0 • • ■ Zn-l,n — Zin Z nn — 2ln , 

(Zu — Zi n • • • Zl,n -1 — Zi n O' 


\ 0 … -2n-l,n 一 ^ln 之 nn _ 2ln / 

and since arithmetic distance is invariant, we see that the matrices 






f Z\n . 

• 2ln 

-Zln ^ 

fT' 

• «ln 

«1»» 、 

and 

2ln 

- 2ln 

*ln 

\ 2ln • 

- 之 In 

名 In / 


、 — 2ln . 

• *ln 

2ln ; 


have the same rank. This is a contradiction, since the former is of rank 1 and the 
latter is of rank 3. Thus z{ n = Zi„ . That is, 

T{Z) = Z. 

The theorem follows. 

7. Proof of the fundamental theorem of the projective geometry of 
symmetric matrices 

A point (X, Y), in homogeneous coordinate, is called finite if y is nonsingular, 
otherwise, it is called infinite. The patricular infinite point (/,0) is denoted by oo. 
Evidently a necessary and sufficient condition for a point P to be finite is that 
r(P, <x>) = n. 


Since the projective space of symmetric matrices is transitive, we may assume 
that the automorphic mapping of the space carries oo into itself. If it keeps arithmetic 
distance invariant, the mapping carries finite points into finite points and keeps the 
arithmetic distance of finite points invariant. By Theorem 5, we have 
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8. Remarks 


1°. The condition stated in the theorem can be somewhat weakened since the 
invariance of the arithmetic distance 1 implies the invariance of arithmetic distances > 
1. In fact we have 

Theorem 9 Two points P and Q are of arithmetic distance p if and only if 
there exists p —1 points X\, …， X p -i such that 

r(P,A-i) = r(X u X 2 ) = … =r(X p _ 2 ，: VO = = 1， (58) 


p being the least integer with this property. 

Proof 1) If P and Q have the arithmetic distance p, we may assume without 
loss of generality that they are 


0 , 



0 

0 


respectively. Then, 



? = 1,2, •■- ,p- 1 


are the points required. 

2) By the triangle inequality for arithmetic distances, we have 


r(P,Q) ^ r(/?, Xi) + r(Xi,Xi) + • • • + r(X p _i ,Q) < p. 


If r(P,Q) < p, then we should have fewer X’b such that (58) holds. Therefore 
r{P, Q) = P- 

From this we deduce immediately the following sharper result: 

Theorem 10 Let E be the set of all symmetric matrices of order n 彡 2 over 
a field $ with characteristic ^ 2. Any one to one mapping of E onto itself carrying 
a pair of symmetric matrices with arithmetic distance one into a pair with the same 
property is of the form 

Zi = aAZ°A! + S, S = S’, 
where o G $ and a is an automorphism of 

2°. The previous theorem seems to remain true for a field with characteristic 2. 

3°. The method for the construction of involutions has not been used to its full 
capacity. In fact, it is very likely that we can construct the whole symplectic group 
by means of this procedure. 



On the multiplicative group of a field* 

Theroem 1 The multiplicative group of a field is either abelian or not 
metabelian. 

Let K be the field under consideration and let Z be its center. If K is identical 
to Z, then the multiplicative group Kq is abelian. Now we are going to prove that if 
Z is & proper subfield of K, then the multiplicative group Kq is not metabelian. By 
the words “not metabelian”，we mean that the derived series of Kq, 

Ko D K.\ D If 2 D • • • D K r D • • • 

never ends at identity, where K r is the commutator subgroup of K r -i. We use 
(a, 6) = a -1 6 -1 a6 

to denote the commutator of a and b, then K r is generated by elements of the form 
(a, 6), where a and b run over all the elements of K r -i- 
To prove the theorem we require several lemmas: 

Lemma 1 Let A = C(x, y) be a field obtained from its center C by the ad¬ 
junction of two elements x and y satisfying 

xy = ryx, 

where r 1) belongs to the center. A necessary and sufficient condition for x being 
algebraic with respect to C is that r is a root of unity. 

Proof 1) Suppose that r is a q-th root of unity (q > 1). Then x q commutes 
with y, therefore it commutes with every element of A. It belongs to C. Then x is 
algebraic. 

2) Suppose x is algebraic, say the equation 

f{x) = akx k + ••• + a 0 = 0 (1) 

has the least degree fc. Transforming (1) by y, we have 

y~ l f{x)y = akr k x k + ■ • • + airx + ao = 0. (2) 

* Received June 7, 1950. 

Reprinted from Science Record, 1950, 3(1): 1—6. 
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Substracting (1) from (2) and dividing by x, we have a polynomial of degree 
k unless o/,(r fe — 1) = 0 for 1 < /i < A;. Since ak ^ 0, we have r* = 1. The ; 
therefore proved. 

Lemma 2 Let S bea field processing two elements x andy satisfying xy = ryx, 
where r belongs to the center of S. We define 

xi = (y, 1 - *) = y _1 (l - a ： ) -1 y(l - ar) = (1 - rx)~ x {\ - x) 

and xi = (j/yXi-i). Then 

xi = xi(x) — JJ(1 — 

(=0 

Proof The lemma is evidently true for / = 1. By the hypothesis of induction, 
we have 

xi = y~ l x{2i(x)yxi-i{x) = 

<=o 

Thus the lemma is proved. 

Proof of the theorem In a previous paper the author ! 1 】 proved that K\ is 
not contained in the center Z. We suppose that K r -\ is not contained in the center 
Z, but K r is contained in the center Z. 

Let 

ai> 02, ••• 

be a representative system of K r -i by K T . By a theorem of the author ⑵， if r -i 
generates the field K, thus we have 

K = Z(aj,o 2 , -). 

where a< belongs to K r -i and (oj,o 7 ) belongs to K r , then to Z. 

Since K is not commutative, we have x = an and y = aj such that 

(x,y) = r, 

which belongs to Z but r ^ 1. 


t=l t=0 
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We define ij as in Lemma 2. Then for / > r, xi belongs to K r , since y and x a 
belongs to K r 一 i and K a respectively. 

1) r is not a root of unity. By Lemma 1, x is transcendental with respect to Z. 
Then, for i ^ r 

xj = I"I(1 - rt：c )( _1 )*(【). 

t=o 

cannot belong to Z. For otherwise, from 

xi H (1 - r 2t+1 z) (a *+> ) = (1 - 

0^2t+l$< 0<2t 矣 t 

we deduce that xi = l and 

E (« + i)r 2t+， = E ( 扑 24 ⑻ 

by comparing the constant term and the coefficient of x. (3) can be written as 

(1-^ =亡(-1邝)一 = 0， 

t=o 

which is impossible. 

2) r is a (f-th primitive root of unity. Let us consider the field A = Z{x,y). Let 
C be the center of A, then, evidently we have A = C{x, y). If C contains only a finite 
number of elements, then A is a finite field, and consequently it is commutative. This 
is a contradiction. Now we assume that C contains infinitely many elements. 

Since x q is commutative with y, it belongs to C, say 

x 9 = a. 

This is a polynomial of the lowest degtee satisfied by x, for otherwise from 


f{x)~ y~ l f{x)y = f(x) - f{rx) = 0, 


we deduce that the order of r is less than q. 
As a runs through C, 


xi(as) = JJ(1 - r 4 aa:)( _1) * (lt ) = JJ(1 — 
t=o t=o 


where 


入 *= E (- 1 ) t+9S (m«)- 
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For / ^ r, xi(ax) belongs to C for all a. Notice that 
9-1 

J2(l — r l ax) = 1 — (aa;) 9 = 1 — aa v 
t=o 

belongs to the center. Let 

—N = min At. 

00^9-1 

9—1 

Since At = 0, N is a positive integer. Write 
t=o 

g-l 

xi(ax)(l — aa q ) N = J"J(1 _ rtax ) Xt+N 
1=0 

=Ao(a) + A\(d)x + • • • + ^4 4 -i(o)x ,_1 , ⑷ 

where the coefficient of a in Ai(a) is equal to 

+ = 5Z A * r< = 51 ( -r ) m ( 1 ) = (1 - ^) 1 T 4 0. (5) 

t=0 t=0 0^m<( \ 171 / 

Since the left hand side of (4) belongs to C and does not satisfy any polynomial of 
degree less than q, we deduce 

山 ⑷ = 0 ， 

for all a belonging to C. Since C contains infinitely many elements, the coefficient of 
a in 七⑷ must be zero. This contradicts (5). The theorem is completely proved. 

As corollaries, we have the following theorems: 

Theorem 2 Let K r be the r-th derived group of the multiplicative group of a 
not commutative field. Then the field is generated by K r - 
Since 

o -1 ft -1 ai = a~ 2 (ab~ l ) 2 b 2 , 


we have 

Theorem 3 Every field which is not commutative has no proper subfield con¬ 
taining all the squares of the r-th commutator. 

This theorem was proved previously by the author【 3 I for r = 1. 
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Remark 1 Theorem 2 can be easily extended to all sorts of commutator 
subgroups. 

In fact, let us define by induction a form of r steps by (o, b) where the greater of 
the steps of a and b is equal to r — 1. We fix a form of r steps, and let each alphabet 
involved run over the elements of the group K 。， then we obtain a system of elements 
L. The group G generated by all the elements of i is called a commutator subgroup 
of step r. Then the field K, not commutative, is generated by G. For the set L 
contains all r-th commutators, this can be proved easily by induction. 

Remark 2 The corresponding theorem about the multiplicative group of in¬ 
vertible elements of a simple ring with descending chain condition is known. In fact, 
we can prove that K\ = K-i except the 2-rowed total matrix algebraic over field of 
two or three elements. 

References 

[1] Hua L K. On the automorphisms of classical group. Monographs of Amer. Math. Soc., 
1950. 

[2] - • Some properties of a field. Proc. of Nat. Acad, of Sciences, 1949, 35: 

533-537, Theorem 1. 

ibid, Theorem 6. 


[3] 






环之准同构及对射影几何的一应用’ 

于一九四九年著者⑴曾解决域之广义同构 ( semi-automorphism of a field ) 
问题.关于此问题之历史，可参阅 Reidemeister ^, WachsPl , Ancochea ' 4 -®', 
KaplaMkyW . 本文将简化原证,并将其推展至无零因子之环 (ring without zero di - 
vteor ), 然后再证明所谓“射影几何之基本定理”.前仅 Ancochea ^ 证得一特例，该 
例中限定四元域 ( quaternion ); 对任何域,既未证明该条件为充分,亦未能证其为必 
要.本文将证明此为一必要且充分之条件. 

定义命 ft 及祀为二环，且设 /£' 无零因子，其间有一对应之关系：对丑 
之一元素 a , 圮中有一元素 f 对应之.若此对应适合下列之性质，则谓之准同构 
(homomorphism), 

(o + fc) a » (1) 

(«i6r = a^. (2) 

如换 （2) 为 

« = ⑶ 

则该对应谓之反准同构 （ anti - homomorphism ); 若换（ 2 )为 

(«Y = {a°) z ⑷ 

及 

( obo) ff = a a b°a°, (5) 

则谓之广义准同构 ( semi - homoroorphism ). 显然准同构及反准同构皆为广义准同构. 
但其逆如何？换言之，是否有异于准同构及反准同构之广义准同构存在？此为—前 
所未决之问通,今解决之 如下： 

定理一广义准同构必为准同构或反准间构 • 

证明由 （5) 及⑴可得 


(abc + daaY = [(o + c)b(a + c) — aba — cbc] l> 

=(^+ c ff )6 ff (a <r + C")- a a b a a a - tfb°c 0 
=a°b 0 <f + ifb a a 0 . (6) 


* 1950 年 4 月收发表予 （:中 国科学 》, V 950. 1 ⑴： 1-6. 
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由⑷及⑹可得 


[(oA)* 7 - aTb^abY - b c a a ] 

=[(o6 广] a + a 0 ^ 0 ) 2 ^ - [a°b a {aby + {abYb 0 a a ] 
=[(a6) 2 + ab 2 a — ab(ab) — (aijba]* 7 = 0- 

因 R ' 无零因子，故应得 

若有一对元素 a 及使 


(oby 


a a b° SR 


吾往证，对任一元素 c 常有 
由此式不真，则由⑺可知 


{ab) a = a a b° ^ b e a a , 


(cb)° = c°b°. 


(cb) a = \fc° ♦ c a b a . 


再由⑺及⑻， （10) 可知 

a a b a + b a c a = (aby + (c6) ,r = [(a + c)b] a = 


K + f ) 俨或 
浐 ( a ff + (f). 


⑻ 

⑼ 


( 10 ) 


此式或与 （8) 矛盾或与 （10) 矛盾，是以 （10) 不能成立，故得 ⑼式. 同法可证，对 
任一元素 d 吾人常有 

(ad)° =a° dT. (11) 

若能证明对任二元素 c 及 d 常有 


(cd) a = d 

则定理已明.今假定有一对 c , d 使 

(cd) a = (Tc a ^ c°dF, (12) 

由上述之法泡制，由 （12) 可得 


(ady = dr a°. 


{cbY = b°c°. 


( 13 ) 
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引用 （7), ⑻，⑼，（11)， （12) 及（13)， 




此二式皆不可能，定理于是证明. 

若 R 及 R ' 为域，则准同构必为同构 （ isomorphism ), 因之得 
定理二凡域之广义同构必为同构或反同构,凡域之广义自同构,必为一自同 
构或反自同构. 

附记一 假定及纪各有一单位元素1及 I ', 若1〃 = 1'，则可由⑻以 
得⑷. 

附记二若以 

( oi » + 6 a) ff = a a b ° + b ° a a (14) 

代（2)，则该对应谓之 Jordan 准同构.假定环允有如次之 性质： 由 2 a = 0可得 
0 = 0,如是则 Jordan 准同构与广义准同构同.其证明如 下：由 （4) 及 （1) 立得 


(o6 + 6o) <r = [(o + 6) 2 -a 2 -6 2 r = K+0 2 - (a 17 ) 2 - (b °) 2 
= 0^ + 6^. 

此即 (14). 反之，于⑴及 （14) 中命6 = a 各得 

(20)^ = 2 a°, (2a 2 ) a = 2(a a ) 2 , 

故得⑷式•再由 (14), (1) 及⑷可知 

2 (aba)° = (2060)^ = |(ai» + ba)a + a(ab + ba) — (a 2 i» + 6a 2 )】 ff 

= [(a^b 17 + b°a a )a v + + b a a°) - (a^b 0 + b a a a2 )\ 

= 2 o ff 6 <T a CT . 


此即 （5) 式. 

兹应用定理二以解决非交换域之一度射影几何之基本定理. 

设 fc 为一域（并不限定 ai > = 6 a )， 其特征非二.以 it 中之任一非全零之元素对 
( Zl , S 2) 表一点.若 A ： 中有一元素 /l 使 J/l = ZlXl ， J /2 =以2,则此二元素对谓之表同 
—点.特如3：2 / 0,则巧）所表之点 P 亦可表为 （ arfa ^ l ) .此 x ^ xi 名为该 
点 P 之非齐次坐标. x 2 =0 之点谓之无 穷点. 显然凡非无穷点必对应于 A : 中之一 
元素.反之，凡 * 中之一元素，必有一非无穷点以之为其非齐次坐标，所有的点成一 
个一度射影空间 7 T . 
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若数阵 

(a b 
y c d 

有可逆性，设其逆由次式决定 



则变形 

z ' = (az + b)(cz + d ) -1 (16) 

变 7 T 为其自己.此种变换成一种 一一 对应.由 （15) 可得恒等式 
{—zcf + a’)(az + 6) = [zJ — b'){cz + d), 

故 （16) 亦可写为 

z ' = + a ')-\2 d ! - b '). (17) 

更普 遍些： 若 o 为 k 域之自同构或反自同构，则可导入变形 

z ' ^( az ^ + b )( cz a + C 1)- 1 . (18) 

此名为广义射影变形.显然，此亦建立: r 之一自己的一一对应. 

如有四点句， Z 2 , 2 3 , Z 4 适合如下之关系，则名为调和点列， 

(Z2 - Z 4 )~ 1 {Z2 - 23)(么1 一 z 3 )~ l (zi -24) = -1- (!9) 


关系 （19) 名为调和关系. 

定理三凡广义射影变形变调和点列为调和点列.反之，凡使调和点列仍变为 
调和点列之变形必为一广义射影变形. 

命 Zi ' 及2/各为及 Zj 之对应点.由 （16) 及 （17) 及关系 （15) 立得 

Zi ' - Zj' =( 一為 〆 + a ， r 1 (2id , -6 / )-(«2j + b){czj + d)- 1 

={-ZiC 1 + - b ')( cz i +d)~ (-2.^ + a')(azj + b)](czj + d) _1 

=(-2〆 + - Zj)(czj + d) _1 . 

代入 （19), 可见 V ，2 2 '， Z 3 W , 仍成一调和点列. 

更由恒等式 

a _1 (a — 6)6 -1 = b _1 (a — 6)a _1 
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可得 

( Z 2 -名4〉 _1 (勿 — 之3)(之3 - «4) _1 = (Z3 - Z i )~ 1 {z 2 - 2 3 )(2：2 -方4广 S 

及 

(*1 - *4) _1 (*1 -勿 )(*3 - 2 4 )' 1 = (Z 3 - Z A )~ l (Zi - 2 3 )(2i - 24) — 1 . 

故推得 

(22 - 2 4 ) _1 (勿一 2 3 )(«1 - «3) _1 (zi - « 4 ) 

=(*3 - *4) _1 [(*2 - *3)(^2 - «4) _1 (*1 - *4)(*1 一 23)_ 1 】(芍-為). 

由 （19) 可知 

(22 - 2 3 )( Z 2 - Z A )~ 1 {ZI - Z A )( Z X - * 3 ) _1 = -1, 

即得 

(«1 — 24)(21 — Z3 )~ 1 ( Z2 — Z3 )( Z2 — Z4) _1 = —1. (20) 

显然 （19) 经 ifc 之自同构不变.由 （20) 可知调和关系,经反自同构亦不变. 

定理三之逆部分乃次定理之 引申： 

定理四凡一域 fc 之自己一一对应,且使调和关系不变,此对应必为如下之形 
态： 

z ’ = qz a r + s , (21) 

此处 g 及 r 为 fc 中非零之元素， a 为 A :之自同构或反自同构. 

因 （ 21 ) 可变任何二元素为 0 及 1 ,故仅需讨论对应之使 0 及 1 不变者.命 
Z - 2 * 7 为此种对应，即 I* 7 = l ，0 ff = 0.由 （19) 式解出2 4 ,则 

Z4 = Zi + {Zl - 2 2 )(Zi +Z 2 - 2z 3 )~ 1 (zi - Z 3 ). 

若勿 =+ Z2 ), 则 24 不能存在，故可知 

4= [卜… 2)] ° = \{zl +4). (22) 

命 24 = 0 ,则 

4 = ( 22i(zi + z 2 )- l z 2 y = 2zf ( 2f + 4) _1 4 - (23) 

由 （22) 中命= 2® 及勿= 2 y ， 则得 


2(x + yr = (2xr + (2yr. 


( 24 ) 
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命 y = 0, 则此〃 =( 2 ；c 广 • 由 （ M ) 可得 

(x + y)° =x a + y°. 
于 （ 23 ) 中命 zi = a, Z 2 — 1-a, J0!) 


(25) 


S (25 ) 可知 
由此式立得 


[2a(l - a)Y = 2a <T (l - a°). 


(a 2 r=(a ff ) 2 - 


{xy + yx)° = [(x + y) 2 - 1 2 - y 2 )° = x a y a 4- y a x a . 
由附记二及定理二，故得定理 . 
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A theorem on matrices over a sfield and its 
applications* 


1 . Introduction 

Let $ be a sfield (or a division ring). Let m and n be two positive integers 
such that n < m. We use M = M( n _ m ) to denote an n x m matrix over $ and 
Af ( n ) = M^ n,n ^ and M' to denote the transposed matrix of M. 

Two n x m matrices Z suid W are said to be coherent, if the rank of their 
difference Z — IV is one. 

The aim of part I of the paper is to establish the following theorem: 

Theorem 1 Suppose 1 < n ^ m. Any one-to-one mapping which carries 
fi x m matrices into nxm matrices and leaves the coherence invariant is of the form 


Zi=PZ°Q + R, (1) 

where P{= P^) and Q(= Q( m )) are non-singular, R is an n x m matrix and a is 
an automorphism of the sfield When n = m, in addition to ( 1 ), we have also 

Z^ = PZ ta Q + R, (2) 


where a is an anti-automorphism of the sfield. 

This asserts that the equivalence relation used in the theory of matrices has a 
unique position. 

In a previous paperl 1 】 the author established a corressponding theorem for sym¬ 
metric matrices by the construction of involutions. The method to be used in the 
present investination is entirely different. The main idea is based upon the method 
of “meet and join". It suggests a possibility for the formulation of this geometry in 
terms of lattice theory. 

In case m = n = 1 , the problem is closely related to tlie fundamental theorem 
of projective geometry over a sfield, which has been discussed, though very incom¬ 
pletely, by Ancochea^. In a short note the author 阆 proved the fundamental theorem 
without restriction. There seems to have no essential difficulty to build up 压 theory of 
projective geometry over a simple ring with chain condition, but now instead of the 


* Academia Sinica. Reprinted from Chinese Mathematical Society, 1951, 1(2): 109-163. 
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harmonic relation, the weaker concept of coherence plays an important role. Later 
we shall treat the geometry of rectangular matrices which is practically more general 
than this. 

In part II and part III, we shall give several comparatively more direct applica¬ 
tions. 

It is well-known that every automorphism of a simple algebra is an inner one 
(see, e.g. Albert [ 2 ], p. 51 ). It was extended by Ancochea【 3 ] and KaplanskyW to the 
so-called semi-automorphisms. As an easy consequence of our present theorem with 
m = n(> 1), we solve the problem about semi-automorphisms of a simple ring with 
descending chain condition for the left ideals. 

It can be readily seen that a semi-automorphism is just a Jordan automorphism, 
provided that the charactaristic of the sfield is different from 2 . Incidentally we solve 
the problem of Jordan isomorphism for simple ring with chain condition. How is 
the analogous problem for Lie isomorphism? The answer is not so perfect as the 
corresponding result for Jordan isomorphism. The author can only establish the 
result for sfield with characteristic different from 2 and 3 . Nevertheless, in case the 
sfield is commutative our result is better that the known ones (Jacobson^). 

Another application is to the Grassmann geometry, namely: 

Let us consider the space formed by (n — 1 )-dimensional linear manifolds of the 
(m+n -1 )-dimensional projective space. Two different (n — l)-dimensional manifolds 
are said to be coherent, if both of them are contained in an n-dimensional linear 
manifold. By means of Theorem 1 , we can obtain all the one-to-one transformations 
keeping coherence invariant. 

More precisely, let 

{Zil, • • • , Zi tm +n)t 1 ^ i ^ n 

be n points in the (m + n — l)-dimensional space. Let the matrix 
^ = ( 之 <i)l<i<n ， lC<m+n 

denote the (n — 1 )-dimensional manifold spanned by these n points, where Z is as¬ 
sumed of rank n. 

Two nx (m+n) matrices Z and Zi, both of rank n，represent the same manifold 
if and only if there is a matrix Q = naturally nonsingular, such that 

Z 1 = QZ. ( 3 ) 

We shall study the space formed by all the (n — l)-dimensional linear manifolds. 
The space admits evidently the mapping 
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Zi = QZ a T, ⑷ 

where Q — 卩⑷ and T — r< m+n ) are both nonsingular. 

Analytically, two (n — l)-dimensionaI linear manifolds W and Z are coherent, if 
and only if the rank of the 2 n x (m + n)-matrix 

U) - 

is equal to n + 1 . 

It will be established as a consequence of Theorem 1 that any one-to-one trans¬ 
formation which carries the space formed by all (n — 1 )-dimensional manifolds onto 
itself and leaves the coherence invariant is of the form ( 4 ). However, in case m = n, 
besides ( 4 ), it admits another type of transformations which will be described in the 
text. 


Part I. Algebraic Part 

2 . Affine geometry 

In this section we shall assume that n = 1 . 

Theorem 1 fails for n = 1 , because the coherence is now merely a condition 
which asserts that the correspondence is one-to-one. For the sake of completeness, we 
give here the following corresponding theorem with an additional condition. Now we 
suppose m > 1 . 

A line (or left line) is defined by those points given by 

- - ,Orn) + ( 6 i, ••- ,b m ), ⑹ 

where the a's and the 6 's are fixed and ( runs over all the elements of the sfteld 
Two lines are called intersecting if they have a point in common. This is called the 
coherence relation betwen lines. 

Evidently, passing througth two points (xi, ••- , x m ) and (yi, • • • , y m ), we have 
a unique line 

- ,x m ) + t(yi, ■■- ,y m ). (7) 

Theorem 2 Let m > 1 . Every one-to-one mapping of the m-dimensional 
space [or left space) on to itself carrying lines into lines is of the form 

(«i, ••- , 2 m ) = (tui,- - ， ^ 广 ^ 十 （ n,--- ， r m ), (8) 

where Q = is a non-singular matrix and a is an automorphism of the sfield. 
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The converse of the theorem is also true, since it is evident that (8) carries 
lines into lines. The theorem should have been known to geometers, since its proof 
depends on the classical method of meet and join. However I could not find an exact 
reference for it. Nevertheless, the theorem is known at least for the real field (Veblen 
and Whitehead^). The proof is comparatively simpler there, since the identity is 
the only automorphism. For completeness a proof of Theorem 2 is given in the next 
section. 

3. Proof of Theorem 2 

1) m = 2. Suppose that a pair of point (oi, a2) and (6i, 62) is carried into (cj, C2) 
and (di, d2) respectively. Then the line joining the previous pair. 

(1 — 1 )( 01 , 02 ) + X ( bi , l >2) 

is carried into the line joining the second pair 

(1 -I 2 )(Ci,C 2 ) + I <T (£ii,d2), ⑼ 

we shall now prove that 1 —♦ x* 7 is an automorphism of the sfield. Since there is a 
mapping of the form (8) carrying any pair of points, into (0,0) and (1,0) respectively, 
it is enough to prove our assertion for (01,03) = (ci.ca) = (0,0) and (61,62)= 
(di, da) = (1,0). That is, if the line («,0) is carried into (t^.O) and 1° = 1, 0° = 0, 
then 

(s + t)° =8° + t a , (at) a = 8°t a . 

These follow from the classical method of “meet and join” as illustrated respectively 
in the following figures: 





Since the sfield is not necessarily commutative, it might cause some difficulties 
for the second case, let us work it out in detail. 

Let l be the line (i,0) which is determined by the points (0,0) and (1,0). We 
use s simply to denote the point (s,0). fVom 0, we draw a line io : ((iZo，®)，® G 中 ) 
which is different from i, and from 1， we draw a line l\ : ((i?i + 6 $) which 

is different from l and is not parallel ① to io ^ h)- The intersection of the 


(J) By parallel, we mean that the two lines do not intersect. 
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lines l 0 and h is denoted by Pi{(lo~ ii) _ 1 ^o, (^o - ii) -1 )- Parallel to h and passing 
through s, we draw a line l s : (xh + s,x). The intersection of l, and is denoted by 
p 2 (s(l 0 - /i) - 1 io,«(^o - ii) -1 )- The line l t joining pi and t is denoted by 

((l- y)((lo - iir'Io.ao- ii) -1 ) + y(l,0)). 

Parallel to l t and passing through fn we draw a line l tt ： 

((5(^0 —:— +x(t — (io - — “) -1 ))- 

The intersection of l s t and the basic line l is given by 1 = si, that is, it is the point 
(st,0). This completes the proof of the statement. 

Let i 4 (®i,a ； 2 ) be the mapping under consideration. Since any three points not 
on a line can be carried into ( 0 , 0 ), ( 1 , 0 ) and ( 0 , 1 ), we may assume that 

A(0,0) = (0,0), A(1,0) = (1,0), ^(0,1) = (0,1). 

After an automorphism, we may assume that, by the result proved before, 

A{x, 0) = A(x, 0) 

holds for all x belonging to $ and, 

^(0. v) = (o> v°) 

for all y belonging to Then the point (x,y), being the intersection of a line passing 
through (a:, 0 ) and parallel to y-axis with a line passing through ( 0 ,y) and parallel to 
x-ax\B, is mapped into (a.j/^), i.e., 

A(x,y) = (x,y a )- ⑽ 

By (9) with (<Xi,a 2 ) = (ci,c 2 ) = 0, ( 61 , 62 ) = (i,y) and (di.da) = we have 

A{t{x,y)) = (t T (i, y* 7 )). (U) 

On the other, hand, substituting tx and ty for x and y in (10) we obtain 

A(tx,ty) = (tx, (ty) a ). (10i) 

Comparing (11) and (10i), we have 

t = <r = 1. 

Therefore 

A(x,y) = (x,y) 

for all x and y in The theorem is thus proved. 
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2) Let m > 2. The theorem can be proved by induction, and we shall leave to 
the reader to generalize it. 

Remark Without any essential difficulty, we can extend our theorem to the 
following form: let (m > 1) be the m-dimensional affine (left or right) space over 
the sfield If there is a one-to-one mapping carrying onto and carrying 
lines into lines, then we have m = m' and that either $ and are isomorphic or they 
are anti-isomorphic. In the first case, and 少 ' m are both left or right spaces; if we 
identify with $ and suppose that and are both left spaces, we obtain the 
transformation (2). For the second case, one of and is the left and the other 
the right space; suppose is the left space and 蚤二 is the right space the mapping 
can be written as 

- ,w m ) -» (Q(wi,- - ,w m ) T )' + (n, •• - ,r m )', 
where r denotes the anti-isomorphism. 

4. Maximal set of rank 1 

Definition A set of n x m matrices over is called maximal (of rank 1)，if 
any pair among them is coherent and if there is no other nxm matrix, outside of the 
set, which is coherent to each element of the set. 

Theorem 3 Every maximal set is equivalent under the group (1) to either 

f Xi X 2 ■■- : Tm 、 

0 0 … 0 

、 0 o … 0 ; 

or , \ 

yi 0 ••- 0 

J/2 0 ••- 0 

K Vn 0 ■ 0 y 

Proof We may suppose that the set contains two points 0 and 

f \ 0 ••- o'' 

0 0 … 0 

Zo= .. . 

〈 0 0 … 0 / 


( 12 ) 


(13) 


is evidently not a maximal set. If the set contains a point besides that of (16), say a 
point (14) with a 2 , • • ■ ,«m not all zero, then none of the other elements of the set can 
be of the form (15). This asserts that the maximal set containing 0 and Zo is either 
(12) or (13), since evidently every pair of elements of (12) or (13) is coherent. 

The above proof enables ns also to establish the following theorem: 

Theorem 4 Given any two coherent matrices, there are two and only two 
maximal sets containing both of them. 
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5. Intersections of maximal sets 


Now we consider the intersection of two maximal sets. Supposing that their 
intersection is not empty, we may take 0 to be one of their common point. A maximal 
set containing 0 is either of the form 


yi y-2 

o o 


Q, P = P (n) , Q = Q (m) , 


x\ 0 

%2 o 


k X„ 0 ••- 0 y 

In fact, let Z be a point of rank 1, then we have non-singular P and Q such that 


1 0 … 0 \ 
0 0 … 0 

P~ l ZQ~ x = 

\ 0 0 … 0 / 


This establishes the assertion by the result of §4. Therefore we may assume that the 
two maximal sets are either 
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The first and the third cases can be treated similarly. 

Now we consider the case (i). Without loss of generality we may assume that 
Q = I. Let 

P = (Pr»)l<r,»<n- 

The intersection is given by those y satisfying 

(P21 ) ••- ， Pnl)’(yi， … ,ym) =0. 


If (p2i, - • - ,Pni) is a zero vector, both maximal sets are identical. If (P21， . • • >Pni) is 
not a zero vector, then (yi ， •.. ， y m ) is a zero vector. That is, the maximal sets have 
only one point in common. 

For the second case, the transformation Z\ = P~ l Z carries both maximal sets 
of (ii) into those of (ii) with / 5 = / and Q = I. Therefore the intersection of two 
maximal sets is given by 



Therefore the intersection of two distinct maximal sets is either empty, a single 
point or a set equivalent to (17) under tho group (1). 

Definition The intersection of two distinct maximal sets, which contains more 
than one point in common, is called a line. 

Theorem 5 Let X and Y be a pair of coherent matrices. There is one and 
only one line passing through them. 
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By Theorem 4, there aro two (and only two) maximal sets containing both; they are 
(12) and (13). The intersection is given by 

f t 0 ■■ 0 \ 

0 0 … 0 _ 


Therfore we have the theorem. 

The general equation of a line can be put into the form 
p'tq + W, 


where p and q are two vectors of dimension n and m respectively. Furthermore, the 
general equation of a line in the maximal set ( 12 ) is of the form 



and in the maximal set (13) is of the form 


f ai \ 



ai 


h 

. 0 

t + 

0 

、 On ) 


\ bn ) 


(19) 


6 . Linear subspaces 


Let us now consider the geometry in a maximal set. We call, sometimes, a line a 
linear subspace of dimension 1. Let / be a line of the maximal set. Let p be a point of 
the maximal set outside of l. Draw all lines in the maximal set passing through p and 
intersecting l. The set of points on all these lines is defined to be a linear subspace of 
dimension 2. Let q be a linear subspace of dimension 2, and let p be a point of the 
maximal space but not in q. We construct the lines passing through p and intersecting 
q. The set of all the points on all these lines is called a linear subspace of dimension 
3, etc. Evidently we have 

Theorem 6 Every linear subspace of dimension d can be constructed from 
any line of it by the previous process and the number of steps is exactly d—\. d, is 
invariant under any mapping leaving the coherence invaHant. 

Therefore, we have 

Theorem 7 The maximal set (12) is an affine space of dimension m and (13) 
is of dimension n. 
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Evidently, for m — n, (12) is equivalent to (13) under a transformation of (2) 
which carries right lines into left lines and it can occur only when the field has an 
anti-automorphism. Moreover, if m^n, any mapping leaving the coherence invariant 
cannot carry (12) into (13). 


7. Arithmetic distance 


Definition Two matrices Xo and X r are said to be of arithmetic distance r, 
if there exist r — 1 matrices 

乂1、 … ，X r -i 

such that Xi and Xi_i(l < t ^ r) are coherent, and r is the least integer h&ving this 
property. 

Theorem 8 Two matrices X and Y are of arithmetic distance r, if and only 
if X — Y is of rank r. 

Proof If X - y is of rank r, we may assume that X = 0 and 

-rr:)’ 


where i ■⑺ denotes the r-rowed identily. Then 


I e 0 
0 0, 


are intermediate points, therefore the arithmetic distance between X and V is < r. 
Further, from the well-known property that 


rank(P + Q) < rankP + rankQ, 

we deduce that, if X and Y are of arithmetic distance r, the rank of X - K is < r. 
The theorem is thus proved. 

Theorem 9 Any r maximal sets of rank 1, which have a unique point in 
common and such that no two of them, have any other common point can be carried 


by (1) simultaneously into 


'x n 
0 

0 



121 

0 


0 

®2m 



0 、 

( 20 ) 

2-rm 

0 


or into 






leaves the first maximal 



which spans with 0 a maximal set of rank 1: 



( 22 ) 


(23) 


(23) cannot happen, since the intersection of the first and the second maximal 
sets contains more than one point. 

Proceeding in this way, we have Theorem 9. 


8. Proof of Theorem 1 




we use also Xi and Yj to denote respectively the matrices 
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0、 

Xi ， （()，••■ ,0 ，巧 ,0,… ， 0). 

0 

: 

Let A(X) be a mapping considered in Theorem 1. By Theorem 9, we may assume 

that 

A(Xi) = X；, X ⑼ = 0. 

By the result of §6, A induces a mapping of Xi onto carrying lines into lines. 
Therefore, by Theorem 2, we have 

x ： =x^Q u (24) 

where Qi = (jf™) is non-singular and 内 is an automorphism of Without loss of 
generality we can assume 

^1 = 1, Qi = / (m) - (25) 

The maximal set of rank 1, other than X\ contsuning both 0 and (0,0, … 

0, -- ,0) is Yj. Thus 

A(Yj) = Y；. 

By Theroem 2, we have 

y*i=PjV Ti , (26) 

where Pj = Pj n ) is non-singular and Tj is an automorphism of 
Let XijEij be the intersection of and Yj, then we have 

A(xijEij) = x^Eij. 

Consequently, Qi and Pj are diagonal matrix. Write 

Qi = [Qil, • ' ■ > Qim], Pj = \Plj, • •' . Pnj)- 



Comparing (24) and (26), we have 

x 1jQH =Pij x ?j> ( 27 ) 

which holds for all Xij of In particular, for Xij = 1, we have qij = Pij. Therefore, 
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If we use Pf 1 A instead of A we may assume without loss of generality, that A satisfies 
(24), (25), (26) and 

Pi =/ (n >. 


Then from (27) for ) = 1 we deduce that 如 =1 and Ui = 1 for * = 1, ••- ,m. 
Further, the linear dependence of ii and Xi implies that of xi = XiQi and XiQi. 
Thus Qi = pj. Since Qi = (1 ， 拟， … ， 9 «n)> Qi = I for all i. Similarly, Pj = 1 for all 
j. Thus we have 

A{Xi) = Xi, A(Y j ) = Y i . (28) 


Now we consider 


Suppose that the rank of 



is n, so is 

(-1 

\ X n ) 




,by Theorem 7. Since 


(X\ — A2I2 —…- AnZn 、 


Xi 入 2®2 入 n®n 

0 


0 

0 J 


^ 0 > 


! — *i — A2X2 — • •. _ A„i n 

x 2 


\ 

is less than n for any A. Therefore 


*1 + 

J=2 

Xh = > 2 ^ h ^ n. 

J=2 


Applying the same method to the t-th row instead of the first row, we have 

= +X^ u S );r j' 

>-=1 
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Eliminating x \ , we have 

㈡ j = 2 

Because of the independence of a:J, … ， a;*, we have 
Hj = 0 , 2 < j < n. 

Thus 

*i = *!• 

Similarly, we have zj = xj, that, is, if X is of rank n, we have 
AX = X. 

As to the singular case, the theorem can be proved as follows: let 




If X\\ ^ 0, we have, by (27), that 


(29) 



( Xu 

X\2 

*13 ■■ 

__ Xin 

*ln+l - ' 

.. 、 


f X\l 

112 

Il3 … 、 


0 

入 2 

0 •- 

. 0 

0 

0 


0 

A 2 

0 … 

A 

1() 

0 

入 3 . _ 

0 

0 

• 0 

= 

0 

0 

入 3 … 


lo 

0 

0 •- 

An 

0 

‘ - 0 1 


V 0 

0 



Xi! - Ill xj 2 - Xu a：；3 - *13 


X 21 


*22 — ^2 x 23 

x 32 x 33 ~ -^3 


/ 


is of rai»k< n for any A 2 A 3 .. •一 0. Therefore the only possible case is that 


*11 - *n = 0. 

If x*! ^ 0, we proceed in the reverse way. Thus we have ijj = :ii. Similarly, we 
have 
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x ij = x ij- 

The theorem is thus proved completely. 

Remark Without any essential difficulty, we can extend our theorem to the 
following form: let 屯 n , m (i < n < m) be the totality of all the nxm matrices over the 
sfield If there is a one to one mapping carrying onto and carrying the 

coherence relation of the one into the other, then we have either n = n',m = m , or 
n = m', m = n': For the first case, the sfields 办 and are isomorphic; if we identify 
them, we obtain the transformation (1). For the second case, the sfield $ and are 
anti-isomorphisc, the mapping can be written as 
(PZ T Q + R)', 

where r denotes the anti-isomorphism. 

Part II. Geometry of Rectangular Matrices 

9. Projective geometry of rectangular matrices 

We introduce the homogeneous coordinates of a rectangular matrix Z (= Z( n ， m )). 
We write 

Z = {X,Y) t 

where Y = Y^ n \ X = The pair of matrices 

(X,Y) 

is called a homogeneous coordinate of the matrix Z. A necessary and sufficient con¬ 
dition that two pairs of matrices 

(X,Y), (XuYO 

represent the same matrix Z is that we have a non-singulax Q (= Q( n )) such that 

Q(x 1 y) = (x 1) y 1 ). (30) 

Now we extend this notion to the pairs with singular Y. 

We start with an n x (n + m)-matrix W of rank n. Two matrices W and Wi 
are said to be equivalent, if there is a matrix Q (= Q( n )) (automatically non-singular) 
such that 

W 9 = QW. (31) 

This equivalence relation classifies n x (n + m) matrices into classes. We identify 
the matrices belonging to a class as a point. The totality of these points form the 
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projective space of rectangular matrices. In case n = 1, this reduces to the ordinary 
projective geometry. 

The space admits the transformation 

W m = QWP, (32) 

where P = p( m+n \ Q = Q(") are non-singular. The space is evidently transitive. 
Two different points Wi and W 2 are said to be coherent, if the rank of 



is n + 1. This is the smallest possible value for distinct Wi and Wa. In fact, if (33) 
is of rank n, since Wi is of rank n, each row of W 2 depends on those of Wi, and 
consequently. 

W 2 = QW 1 . 

That is, Wi and W 2 represent the same point. 

Evidently, (32) leaves the coherence invariant, since 

(^)^( Ql °)( WI )P. 

\ ^ ) V 0 ^2 J 

Let us break W up into two parts 

w = {x,Y), x = x (n - ro) , r = y (n) . 

Those with non-singular V are called finite points and those with singular Y are called 
points at infinity. If W = (X*, Y*) and IV = (X, Y) are both finite points, we have 
the birational transformation 

Z* = = (ZB + D)~ l {ZA + C), (34) 


A B 
C D 


A = i4 (TO) , B = C = C (n,m) , D = P (n) . 


Two finite points Z and Zi are coherent, if and only if the rank of Z — Zi is one . In 
fact, we have now 




Theorem 10 Any coherent pair of points can be carried, simultaneously into 

( O , 約 


and 

where N is a matrix with 1 at (1 ， 1)-position and zero elsewhere. 

Proof Since the space is transitive, we may assume that one of the points is 

(O,/<**>). 

Break the coordinates of the other point into 

(x (n.m) ， y (n)) (35) 

Then X is of rank 1. We have matrices Q (= <?( n )) and P(= P^) such that 
QXP = N. 


Then, we have 

Q(X,Y) ^ P o ^ ^=(N,QY) 

and 

(0， J )( 二 ^)=( 0 , 1 ). 

Therefore we may assume that X = N in (35). 

If V is non-singular, then 

) =(W，/) 

and 

y 0 ,) = (o，/). 

The theorem is proved. 
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where j/ 2 . • - - ， y n independent vectors. There exists a vector z such that 
(z + A 2 J/2 +- l-A n y„ \ 


if of rank n. Let 


( 0 , 1 ) 


: H = 


( 0 , 1 ) 


(KY)I 


-(N,NC + V), 


where NC + y is non-singular. The theorem reduces to the cases considered previ¬ 
ously. 


10. Projective geometry of square matrices 

In case m = n, there is an extra type of transformations which corresponds to 
the non-homogeneous transformation of the form 

Z* = (Z T 'B + D)~ l {Z T, A + C), (36) 


where r is an anti-automorphism of the sfield. Among such transformations the most 
essential one is 

Z* = Z r， . (37) 

If homogeneous coordinates are again introduced, we have 
Y'~ x X* = X T 'Y r '- x , 


that is 

X*Y T, = Y'X T, , 


we write this relation as 

(x*,y*)5(x,r) T, = o, 


(38) 
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where 



(39) 


Now we are going to prove that (38) defines a one-to-one mapping carrying the 
projective space of matrices onto itself. Let (Xi.Vi) be another point which satisfies 


= 0 - 


Then, we have 

By Sylvester's law of nullity, we deduce that 

x* y* \ 

Xi Y,) 

is of rank < n. Since (X*,y*) is of rank n, we have a non-singular Q such that 

(x*,y) = Q(x 1 ,y 1 ). 

More generally, let 5R be any 2n-rowed non-singular matrix. Then 

(X*, Y*)n(X, Y) T ' = 0 (40) 

defines a mapping from (,X,Y) into (X*,K*). 

The product of two such relations is a transformation of the form (1). In fact, 
let cr = tti which is an automorphism of the field, then from 

(XM^X^r = 0 


and 

(A, 聊 2(U) Tl, = o, 


we deduce 


( (x,y) <r 5tp , 

V (X 2l l2)fR 2 


)(f ，= 0. 


By Sylvester’s law of nullity, we have 


(X 2 ,r 2 )SR 2 = Q(X,5TK ?'， 


i.e., 

(x 2 ,r 2 ) = Q{x, ( 4 i) 
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which is a transformation of the form (1). 

This establishes also the following. 

Theorem 11 The enlarged group generated by all the transformations of (1) 
and all the relations (40) is simply generated by (1) and one of relations of (40), say 
for example (38). 

Now we are going to state the fact that the coherence also remains unaltered 
under the enlarged group. 

Theorem 12 The mapping (40) keeps the arithmetic distance invariant. More 
precisely, let 

(X* i y*)<R(X,F) T, = 0 

and 

= 0. 


The two matrices 

have the same rank. 

Proof We have a transformation (1) carrying (m and (Xj,yi) into (O,/) 
and (I» r , I), where 

O \ 
o o )' 




let the transformation be 


Let 

then, we have 


and 


Consequently 


(X,Y) = Q(U,V)T. 

= {U\V), 
W)(CU) T, = 0 
(U ； ,V ： )(L r ,iy = 0 . 
V =0, U^L r + V ； = 0. 


The rank of 


f IT V* 入一 （IT O \ 
V ^ = V U i - U i L r ) 



is equal to the rank of U{L r , i.e. $ n + r. Therefore we have 


u 

y\ 




^ ) 



V 


, (V 

V 

==rank of 1 




\ut 



By symmetry, we have the theorem. 

FYom (41), we deduce easily that (40) is involutory, if and only if <r = 1 and 
讲 = e ^ r， where q belongs to the center of Consequently 91 = 
that is qq t = 1. Such an involutorial correspondence (40) is called a duality. The 
fixed elements (if exist) of a duality, i.e. those points {X, Y) satisfying 


(X, YMX, Yf = 0 

a complex. The coherence o 
)lex is equivalent to the sta 


(Xi, Yi) both on the complex is equivalent to the statement that the rank of the 
matrix 

(X, 

is one. In fact, the rank of 


(x, 

0 


is equal to 1. By Sylvester’s law of nullity, we have 


is of rank < n + 1. The assertion follows. 

In case the complex is not an empty set and ^ is of characteristic ^ 2,5K can 
be transformed into a particularly convenient normal form. We have two n-rowed 
matrices S and T such that 

is non-singular. Then 






•296- 


华罗庚文集代数卷 n 


where B is non-singular and H T, = qH. Since 


0 I )\ q t B t, H )\ O I )" 


we have a matrix ff) such that 


Corresponding to this normal form, we have the transformation, in non-homogeneous 
coordinates 

z* = - e z T, . 

It is an interesting problem to characterize the geometry of matrices on a complex 
by means of the property of coherence. In a previous occation, the author established 
a particular case with $ commutative and r = 1 and e = —1. 

Remark In the non-homogeneous coordinates the transformation takes the 

form 

Z m = (ZB + D)~\ZA + C) 

and 

Z* = + D)~ l (Z r 'A + C). 

Apparently, it seems to be difficult to construct the product of two transformations. 
This difficulty can be avoided by means of the following identity: for any Z (= Z^ n >), 

{ZB + Dy^ZA + C) = {SZ- R){-QZ + P)~\ 


( A 


( P 

Q). 

.( 1 


\C 

D) 

\ R 

s)~ 

Vo 

n 


11. Maximal set 


We define similarly that 

A maximal set (of rank 1) is a set of points in the projective space of rectangular 
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Similar to Theorem 3, we have 

Theorem 13 Under the group (32), every maximal set {of rank 1) is equivalent 
either to the set constituted by the finite points 


and the infinite points 


1 0 ••• o\ 

0 0 1 … 0 

0 0 0 … 1 j 

x m 0 0 … 0 > 

0 0 1 0 

0 0 0 … 1 , 


or to the set constituted by the finite points 

^ j/i 0 • • • 0 1 0 

y2 0 ••- 001 

yVn o ••- 000 

and the infinite points 

^ j/x 0 … 0 0 0 

1/2 0 • • • 0 0 1 

\ y„ 0 • • • 0 0 0 


0 

0 



0 

0 


(42) 


(43) 


(44) 


(45) 


The proof is similar to that of Theorem 3 with some slight modifications. 

Analogously, we define “lines” as the intersections of two distinct maximal sets 
which contsun more than one point in common, and the arithmetic distance between 
two points as we did in §6. By these concepts, we construct linear subspaces (cf. §5). 
The linear subspaces of dimension m—n axe of primary importance. It can be proved 
without difficulty that every subspace of dimension m — n can be carried into the 
following normal form: the set S is formed by the points 
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and 


f *n+i … x m 1 0 

0 … 0 0 1 

< 0 0 0 0 


( yn+l … Vm 0 
0 0 0 

. 0 • ■. 0 0 



0 … 


where at least one of the y's is not zero. 


0 0 
0 0 

1 0 

0 0 
0 0 

1 0 


0 、 
0 

0 


0 

0 

0 


12. Fundamental theorem in the projective geometry of rectangular 
matrices 


Theorem 14 Any one-to-one mapping carrying the projective space of rect¬ 
angular matrices onto itself is of the form (37). In case m = n, we have an extra type 
as described in §8. 

Proof Since the linear subspaces of dimension m — n are transitive, we may 
assume, without loss of generality that the mapping A under consideration leaves the 
manifold S invariant. 

It is easy to verify that a point P is a point at infinity, if and only if, there is a 
point on S, of which the arithmetic distance from P ia <n. 

Thus the mapping A carries finite points into finite points and infinite points into 

infinite points. By Theorem 1, we may assume that all the finite points are left fixed 

elementwisely by A. Since any m - n dimensional linear subspace may be legarded 
as a manifold S, applying the same argument as above to new S’s and comparing the 
common finite points arising from different manifolds 5 we know that all points, both 
finite and infinite are left fixed elementwisely by A. The theorem is thus proved. 


Part III. Applications to Algebra 

13. Module 

Now we consider the set of n x m matrices as a module, for it is closed with 
respect to addition. A one-to-one mapping (A —* A°) of a module onto itself is called 
a module automorphism, if it keeps the additive relation invariant, that is, 


(A + B)° = A° + B°. 


(46) 








Theorem 15 Every module automorphism of the set ofnxm matrices carrying 
matrices of rank one into matrices of rank one is of the form 


where P (= /*( n )) and Q (= Q^) are non-singular and a is an automorphism of the 
sfield When m = n, in addition to (47), we have also 


where t is an anti-automorphism of the sfield. 

In fact, let r{Z) = Z\ be the mapping under consideration. FVom (46). we 
deduce immediately 尸 ⑼ = 0, r(—A) = -T(A). If P—Q is of rank 1， then r(P—Q) = 
r(P)~r(Q) is also of rank 1. Thus the theorem follows immediately from Theorem 1. 

Now we restrict ourselves to the case m = n. The set of all n-rowed matrices 
form a ring which is known as a total matrix ring. A semi-automorphism (^4 —* A a ) 
of a ring is a one to one mapping of the ring onto itself satisfying 

(A + BY = A a + B a , (49) 

(ABAy = A°B a A ,r (50) 


Evidently, automorphisms and anti-automorphisms are semi-automorphisms. 
From (50) and (49), we dequce 

(XY + YXY = [(X + Y) 2 -X 2 - Y 2 } a 

=[(X + Y) a ] 2 - {X°) 2 - {Y a ) 2 = X a Y° + Y°X°. (52) 


It is easy to deduce (50) and (51) from (52) and (49), provided that the characteristic 
of the sfield is different from two. In fact, it follows from the identity 

2(XYX) = {XY + YX)X + X(XY + YX) - (X 2 Y + YX 2 ). 

Theorem 16 Every semi-automorphism of the total matrix ring over a sfield 
is either of the form 

Zi=PZ T P-\ (53) 

where t is an automorphism of the sfield, or of the form 
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Zi = PZ^p- 1 , (54) 

where r is an anti-automorphism of the sfield. 

Proof 1) By Theorem 15, it is enough to prove that a semi-automorphism 
carries matrices of rank 1 into matrices of rank 1. Let A —♦ A a be the mapping under 
consideration. 

2) An element A satisfying ^4 2 = j 4 is called an idempotent matrix. FVom (50) 
with B = 1, we have (A CT ) 2 = A°. That is, a semi-automorphism carries idempotent 
elements into idempotent elements. 

3) Let £ be an element orthogonal to an idempotent element A, that is, AB = 
BA = O. Then, by (52), we have 

0 = (AB + BA)° = A a B° -B a A a 

and by (50), 

0 = {ABA)° = A°B a A° = ~{A a ) 2 B° = -A a B°. 

Therefore 

A a B a = B a A a = 0. 

4) An idempotent element A is called irreducible, if j4 = B+C, B 2 = B,C 2 = C, 
BC = CB = O implies either B = O or C = O. This property is evidently invariant 
under a semi-automorphism. 

5) The theorem follows from Theorem 1 and the following 

Lemma A matrix M (j^O) is of rank 1 if and only if there exists an irreducible 
idempotent matrix E such that 

(I- E)M{I -E) = 0 

and 

(Af - EME) 2 = 0. 

Notice that both relations are invariant under a semi-automorphism. 

6) Proof of the lemma. If M is of rank 1, there is a matrix P such that 



Prom the second equation 



it follows that either b = o or c = o. Therefore M is of rank 1. 

Remark It is known【 9 l that a simple ring with descending chain condition for 
left idelas is simply a total matrix ring over a sfield. Therefore Theorem 16 can be 
“dignified” as follows: 

Theorem 17 A semi-automorphism of a simple ring with descending chain 
condition for left ideals is either an automorphism or an anti-automorphism. 

Theorem 18 An automorphism (or anti-automorphism) of a simple ring with 
descending chain condition for left ideals is a product of an inner automorphism and 
an automorphism (or an anti-automorphism) induced by an automorphism (or an 
anti-automorphism) of the sfield. 

The method of the proof can be easily extended to establish a corresponding 
theorem for semi-simple rings. 

Let R be a semi-simple ring with descending chain condition for left ideals. It is 
known that it is isomorphic to the ring formed by all the matrices of the form 
x (m l)+x (m 2 ) + ... + x (m r ) ) 

where xj mt) runs through all matrices with elements in a sfield $i. 

The steps 1) 〜 8) are the same except that the E of 5) may be one of that form 
in a certain block. Let E belong to the i-th component, and let its image belong to 
the j-th component. The same method 7) asserts that the mapping carries all the 
elements of rank 1 in the i-th component into all the elements of rank 1 in the j-th 
component. Therefore our mapping carries the i-th component into the j-th com¬ 
ponent. The i-th component is matrices in and contains mutually orthogonal 
irreducible idempotent elements, and we deduce then that rru = rrij and 屯 and 
are isomorphic or anti-isomorphic. If we arrange the decomposition into a proper or¬ 
der, the semi-automorphism induces a semi-automorphism on each component. This 
is a generalization of a result of Kaplansky^. 
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Certainly, we can apply the previous method to the problem of isomorphism of 
different Jordan rings. 

14. Jordan and Lie automorphisms 

A one-to-one mapping of a ring onto itself satisfying (49) and (52) is called a 
Jordan automorphism of the ring. Since (49) and (52) imply (50) and (51), in case 
that the basis sfieid is of characteristic different from 2, we can restate Theorem 15 
as 

Theorem 19 A Jordan automorphism of a simple ring with descending chain 
condition for left ideals is either an automorphism or an anti- automorphism, provided 
that the characteristic of the afield is different from 2. 

An analogous, but much more difficult, problem is about the Lie automorphism. 

A Lie automorphism (^4 —♦ A°) is a one-to-one mapping of a ring onto itself 
satisfying the property that 


(A + B)° 

= A° + B' T 


and 

{AB- BA)° = 

: A a B° - B a A°. 

(55) 

For brevity, we introduce the notation 

[i4, B]= 

AB- BA. 

(56) 

Therefore (55) can be written as 

\A,B] a = 




The module generated by the elements of the form [A,B] is called the derived 
Lie module of the ring, or simply the derived module. We use L to denote the derived 
Lie module of the sfieid 

Now we consider again the total matrix ring R over a sfieid The elements 
of the form XI is isomorphic to the ground sfieid, we identify these elements with 
those of the ground sfieid. The results of Lie automorphism is not so perfact as that 
of Jordan automorphism; we have to impose several conditions in order to obtain a 
similar conclusion. Before we are going to prove the theorem, we need quite a number 
of lemmas. 

15. Properties of the derived Lie module 

Theorem 20 A necessary and sufficient condition for a matrix belonging to 
the derived Lie module of R is that its trace belongs to the derided Lie module L of 电 . 
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+ A 2 ,B] = [A 1 ,B] + [A 2 ,B], 

t A = aEij, the trace of [A,B] belongs to the derived 
Lie module L of where Eij is the matrix with zero element everywhere except 1 at 
the (*» j)-position. Now we have 

tr[o^y, B\ = ti{aEijB — BEija) = abji — bjia, 
which evidently belongs to L. 

2) we can readily verify that 

[aEij,Ejj] = aEij, for i ^ j 

and 

= aE n - aEu, for i^l. 

Any matrix with trace belonging to L is a sum of the previous elements and 
bE u , 

where b belongs to L. Since 

b = y^(a<6i - bidi), 

we deduce that 

bEn = — biEuatEn), 

the theorem is now proved. 

Consequently, the property that the trace belongs to the derived module of K is 
invariant under similarity transformation, since 

P[5, T]P~ l = \PSP~\PTP~ l ). 

Incidentally, we obtain the following 

Theorem 21 If S and T are similar to each other, then the trace of S is 
congruent to that of T, mod L, where L is the derived Lie module of 屯 . 


It is sufficient to prove successively that the theorem is true for n = 2 and for 


： )'(o ：： 


since the general linear group is generated by these elements. Since 
AaA -1 -a = (Aa)A -1 - A _1 (Aa) 
belongs to L, our theorem follows immediately. 
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16. Lemmas 

Now we assume that the characteristic of the sfield diffes from 2 and 3. 

A matrix A is calld an J-matrix, if it can be expressed as 

A = al + E, (57) 

where a belongs to the center of $ and E is an idempotent matrix, i.e., E 2 = E and 
E^O. 

Theorem 22 A matrix A is an I-matrix, if and only if 

= (58) 

for all B. 

Proof We have 

[A,[AK-B]]] = A 3 B- 3A 2 BA + 3ABA 2 - BA 3 . 

Since [^4, B] = [A-al, B], evidently (57) implies (58). 

Now we suppose that 

A 3 B- 3A 2 BA + iABA 2 - BA 3 = AB - BA (59) 

for all B. We write 

A = (Oij), A 1 = (o^), B = (bij). 

Then, for all B, we have 


^2( a if b ik - -3 ^ (a^bjiaik - Oijbjiaj^)= 乞 -6 ㈣ *>)■ 

i=i j,i=i i-i 


Putting B = xEji, we have 



a^fxaik = Oijxa\l\ for j ^ i, 


(60) 

a!：fx - OijX- 3a^xakk + 3^10^ = 0, 

for j ^i,k = l, 

(61) 

— xaik + 3a\^xai k - 3anxal^ = 0, 

for j = i,k l 

(62) 

and 

a[fx - xa^ - 3^10^+ 3auxa^ = 

: aux — xakk 

(63) 

for j = i,k = l. (60)—(63) hold for all x. 




If 4 is a central element, i.e., A - cl, where c is a cnetral element of K, then 
A = (c-l)I + I and A is an /-matrix. Thus there remains to consider the case that 



4 is non-central. If A does not belong to the center, we have an invertib: 
° such that the element at (l,2)-position of PAP~ l is different from zero, 
f there is a non-diagonal element of A different from zero, we have a per 
natrix P such that PAP~ 1 satisfies our requirement. Next, if j 4 is a diagone 
he statement follows from the following formula 


(1 « 

U« o\ 

( 1 

(a tb - at \ 

U i 

Jlo k ) 

U ij 

Vo b ) 


since the relation (58) and the property of a matrix being an /-matrix 
nvariant under similarity transformations, without loss of generality we ca 
hat a \2 7^ 0. 


Prom (60), we have 










I 


for fc ^ i when aij T 0 and aik T 0. lne element g is mdepenaent oi i ana k. ii 
^ o but ajfc = 0, then a 泛 ）= 0, （ 64) is also true. Finally, if = 0, then = 0; 
in this case (64) also holds for p = 0. Therefore (64) holds for all fc ^ i. From (61) 
with a: = 1, we have 

a i? - a i2 = 3ai?an - 3ai2C^ = 3a^akk - 3ai20^ , 
then, since the characteristic of the sfield is / 3, 

a i2 — a **) = a ia( a i? — )• 

Prom (64), we deduce 

- a.^1 — ei^kk - an)- ( 肪 ) 

FVom (64) and (65), we have 

A 2 - a^I = g(A - a n I). 
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That is 

A 2 = gA + XI, 


( 66 ) 


where q belongs to the center. We deduce immediately that XI is commutative with 
A. Repeating (66)，we have 

A 3 = gA 2 + \A= (e 2 + A)A + q\I. (67) 

Substituting (66) and (67) into (59), we have 

[(e 2 + X)A + q\I\B- 3(M + \I)BA + 3AB(gA + XI) 

-3B[(e 2 + X)A + e XT\ = AB - BA 
for all B, and then 

(p 2 + 4A- 1)(AB - BA) = 0 


for all matrix B which are commutative with A. Since A does not belong to the 
center, we can choose a B which is commutative with A but not with A. In fact, we 
can choose B as a matrix with elements in the prime field and AB ^ BA. Therefore 
we have 

g 2 + 4 入 -1 = 0. 


Then, from (66), 


that is, i4 — — /is an idempotent element. 

Remark The theorem is not true for fields of characteristic 3. In fact A 3 = 
A + tI where r belongs to the center, is a solution, but A is not an /-matrix, 

7 0 1 0 \ 

0 0 1 

， 1 oj 

is such a matrix. The theorem does not hold for fields of characteristic 2, since 

O 

is a “Gegenbeispiel”. 

A matrix A is called an N-matrix, if it can be expressed as 
A = aI + N, 

where a belongs to the center of $ and N 2 = 0. 


(68) 
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Theorem 23 A matrix A is an N-matrix, if and only if 

[A,[AI^B]]] = 0 (69) 

for all B. 

Proof Instead of (59), we now have 

A 3 B- 3A 2 BA + SAB A 2 - BA 3 = 0. (70) 

Proceeding as in the proof of Theorem 22, we prove that 
A 2 = qA + XI, 

where + 4A = 0. Consequently 

17. Continuation 

Theorem 24 Suppose that A 2 = A, AC = CA, AD = DA and A ^ Q and 
^ I. Then 

[D 1 [A,B]] + [C[A,[C 1 |AB]]]] = 0 (71) 

for dl B if and only if either 

C = XI+ F, D = nl + F 2 , (72) 

where X and /x belong to the center and AF = FA = 0, or 

C = - G， D = /iI- G 2 , (73) 

where A and \i belong to the center, AG = GA = G. 

Proof We expand (71) as 

D{AB- BA)- (AB- BA)D + AC 2 B + C 2 BA + ABC 2 + BAC 2 
-2{AC 2 BA + ABAC 2 + ACBC + CBAC) + AACBAC = 0, 
which can be written as 

(C 2 + D)AB + BA{C 2 +D) + {C 2 -D- 2C 2 A)BA 
+AB{C 2 -D-2C 2 A) + 2C{2ABA - AB - BA)C = 0. (74) 

In order to verify that (72) implies (71), we need only to prove that C = F and 
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where / \ / \ 

D =( n 11 n 2 )'^- 

\ ^21 ^22 / V ^ 21 ^ 22 / 

Then, for all B21. we have 

(i) (C12 — Di 2 )Bii = 2C12B21C11] 

(ii) C12B21C12 = 0 ; 

(iii) Bii(Ch + Du) + (C ;2 _ ^22)^21 = 2C22B21C11 

and 

(iv) B21 (C12 + D12) = 2C22B-ilCi2- 

FY^om (ii), we deduce that C12 = 0 ① and from (i) and (iv) we deduce D i2 

C{ 2 = 0. 

Putting 

B 



ABJorCJJ 

1121 /- \ 

u Dcnell 
+1222 / . 
\ —/ Dp 、—/ 

s 气！ 

1 1 /-\ o B 

QQ-(. 
/—\ \—/ \— / 
Kcr22c 2a 
00 11 
,= crcjcncll 
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we deduce similarly D21 — Cf 2 = C21 = 0 and 

(v) (Cj 2 + Dll)Bi2 + Bi2(C22 — D22) = 2CnBi2C22- 

Since C22 = Cfj, we write 

C 22 ~ ^22 — ^11 + ^11 = Wij)， 

C22 = (7«)» Cn = (rfij) 
and B21 = (6y), then, from (iii), we have 

r n—T n—r r 

^2 bij0jk + Y1 a V b Jk = 2 lisbatStk 

j-1 j=l «=1 t=l 

for all bij. Consequently, we have 

(vi) 7ia<5tfc = 0 for a; 

(vii) 2 -yubStk = b^tk for t ^ fc; 

(viii) 27 ia Wfcfc = a ia b for i ^ s; 

(ix) 2 -yijbSkk = ocnb + bfikk 
for all 6. 

If there is a non-diagonal element of C22 different from zero, say 7“ 一 0 and 
i ^ s, then, from (vi) we have 6 t k = 0 for all t 5^ k. Then /?tfc = 0 for all t # fc, by 
(vii). Further, from (viii), we deduce that 

2 ^ Iq: « * 6 

belongs to the center and 8 kk — That is, Cn =* 51. 

FVom (ix), we have 仇* = /3 for all fc and 

2-ynSb = aub + b(3 

for all b. In particular 6=1, then cm = 2 -yu 6 - 0 . Substituting into the previous 
equation, we have b 0 - 0 b = 0 for all b. Therefore /3 belongs to the center. Then 

(x) CTJi+Di^/3/. 

From (iii) and (ix), we have 

C| 2 - I> 22 = (aij) = 2( 7y )<5 -pl = 2 C 2 i S- 0 I. 

That is 

D22 = 2 Cg 6 + 0 I = {C 2 2 -S 1 ) 2 -{S 2 ~ 0 )I. 

Therefore 

C 
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and 

D= 卜 ° )^(的 h O \ 

\ o D n ) \ O (C 22 ~ 5I) 2 - (S 2 - 0)1 ) 

”/+ ( o (^.^)2 )• 

This proves the theorem. 

By a slightly different method, we establish our theorem for the case Cu is not 
diagonal but notice that now we obtain the second conclusion instead of (8). 

Suppose that Cu and C 22 are both diagonal, then, from (vii) and (viii), both 
Du and D 22 are diagonal, and 
(xi) 2yibS k = otib + b0 k 

for all 6, we write ^ for etc. If there is one of the differences 、一 different from 
zero, then we have 

2(7* - = (a< - aj)b 



for all b. It follows that 

0k = P = 27<5 - 

is a central element for all fc. Therefor we have 

C?i + D u = 01, Cn = 61. 


Then 


and 


[ C o ： H S o LY-^{°o c：-,) 



o (0-S 2 )I o 

D22 y \ o C22 ~ 26C22 + 01 


(0-S) 2 I + 


o 

o 
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The theorem follows. 

In case there is one of the difference Si — Sj different from zero, then we have 
7* =7. a* = «. 

which belong to the center, and 

2-ySk = a + /3fc. 


Then 

C22 - Da = al, C22 = ll 

and 

C11 + Du — —al + 


Then 




(C u 
\ O 

O 

7 / 

y 

-(VI) 

and 

D = 

( D n 

V 0 

O 

D 22 

h 

(T 2 - 


That is 


C = 

=XI 

-G, 

D = nl- G 2 , 

where 


G 

< 

-(Cn-ll) 0 \ 

0 O ) 

satisfies evidently GA = 

AG = 

G. 




Theorem 25 Suppose that A 2 = A 0,1), C 2 = C, AC = CA. Then 

[C,[>i,B]] + [C7[>1[C7, [A 丑 ] j]j = 0 (75) 

for all B, if and only if 

AC^=0 or (/ - A)(I - C) = 0. 

Proof The theorem is an easy consequence of Theorem 24, by putting D = 
C 2 = C. FVom (72), we have 

C 2 = (XI + F) 2 = nI + F 2 , 

then A 2 = fi,2\ = 0. Consequently C = F and AC = CA = 0. 

From (73) and C 2 = D = C,we have 
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C 2 = (A/ - G) 2 = - G 2 = A/ - G, 

then A = 1 and 

(/- A)(I -C) = (I- A)G = G - G = 0. 

Remark Theorem 25 can be proved very simply by the direct expansion of 
(75). In fact (75) is equivalent to 

CA(CAB - CBA- ABC + BAC) 

- C{CAB- CBA- ABC + BAC)A 
- A(CAB - CBA- ABC + BAC)C 
+ {CAB- CBA- ABC + BAC)AC 
=-(CAB- CBA- ABC + BAC). (76) 

Evidently CA = 0 or (J - C)(I - A) = O implies (76). Conversely multiplying (76) 
by AC on the left, we have 

ACB(I -A)(I-C)=0 

for all R This implies either AC or (/ - A){I ~C) = 0. Therefore we have Theorem 
25. 

18. A theorem on Lie automorphism 

The situation about Lie automorphism of the matrix ring R differs greatly from 
that of Jordan automorphism. 

Definition A mapping A—^A^oiR into itself is called a Lie representation, 
if it satisfies 

(A + By = A a + B a , [A,B]° = [A° ,B a \. 

The representation is called central, if it maps into its center, that is 

\{A)I. (77) 

where A ( i 4) belongs to the center of the basic sfield 

Theorem 26 Every central representation can be expressed as 
A(i4) = 

where trd denotes the trace of A and n(a) is a central Lie representation of the sfield 
中 . 

Theorem 27 A central Lie representation of a sfield is a representation of the 
factor group of the additive group of 电 over its derived. Lie module into the center of 
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Owing to their simplicity, we omit the proofe of both theorems. 

If A —► is a Lie representation, so is 

A-*X{A)I + A a . (78) 

This very fact makes some trouble. In fact, sometimes, (78) ceases to be one-to-one. 

First of all, let us investigate the condition under which (78) is one-to-one. That 
is, to find the condition under which the equality 

\{A)I + A a =0 (79) 

implies A = O. 

Evidently, if A belongs to the center, A a belongs to the center. Conversely, 
suppose that A° belongs to the center, but A does not. There exist a matrix B such 
that C = [A, B] ^ 0. Then C is mapped into 

[A(^)/ + A a ,X(B)I + B v } = [^,^) = 0, 

which contradicts the assumption “one to one”. Therefore, if A a belongs to the center, 
so does A. 

Foom (79), we therefore deduce that A belongs to the center, say A = al, where 
a belongs to the center of FYom (79), we deduce the condition 
n/i(ot) + = 0. 

Therefore (78) is one to one, if and only if 

nn(a) + a a ^0 (80) 

for all a 0) belonging to the center of 

Theorem 28 Let ^ be a sfield with characteristic different from 2 and 3. 
Every Lie automorphism of R(n > 2) is either of the form, 

AA)I + PA T P~\ (81) 

where p(d) is a central Lie representation of $ and r is an automorphism of $ and 
n/i(a) - a T 0 for cdl 0) belonging to the center of or of the form 

A-* n{trA)I - PA r 'P~\ (82) 

where t is an anti-automorphism of $ and n^i(a) — a T / 0 /or aZZ a (# 0) belonging 
to the center of 

Proof 1) Idempotent elements. Let Ei, , E n be n elements different from 
zero such that 


Ef^Ei for l<t^ 
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and 

EiEj = EjEi = 0 for t ^ j. 

By Theorem 22, E?s are carried into 

Xil + E：, E： 2 = E；, 

where Xj belongs to the center. From Theorem 25, we have either 
= £；£； = 0 


or 

(/- E ； )(I-E；) = (/- E ； )(I-E ： )=0. 

Suppose that we have 

E ； E ； = E{El = 0 , 

but 

(I-E ； )(I-E ； )=0. 

Then, we have 

E\= ET (/ - E* 2 ) = E{{I - 玛 )(/ - 五 I) = 0 ， 

which is impossible. Therefore we can only have either 
(i) = E X jE ； = 0 for all i ^ j, 

or 

(I - £?；)(/ - E；) = (/ - E；)(I -E*) = 0 for all 

For the second case, we replace I - E* by E* we have then 
⑻丑 i — 
where 

Ef = £；；, ETE] = £■；£； =0 for j. 

Without loss of generality, after subjecting the given Lie automorphism to an 
inner automorphism, we may assume that 

Ei = Ei — (a, t ), 

where a,t = 0 except an = 1. 

2) We have 


[Ei, [Ej , P]] = pEij + qEji for i^j, 


(83) 
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where pEij denotes a matrix (a st ) with a s t = 0 except aij = p. Consider elements 
of the form (83) whose squares are zero. Then pq = 0. Therefore the matrix pEij is 
mapped into 

XI + N, N 2 = 0， A e center 
by Theorem 23. Moreover, by 1), (83) is mapped into 

p m Eij + q*Eij = XI + N. 


We deduce 

0 = N 2 = {p'Eij + q'Eji - XI) 2 , 

that is 

p* q. Ei + q*p* Ej — 2(p*Eij + q*Eji)X + A 2 / = 0. 

If A 56 0, then p* = 9 * = 0, which is impossible. For A = 0, we have p*q* = q*p" = 0. 
Consequently, a matrix of the form pE^ is mapped either into p*Eij or into q*Eji. 
Next, from 

aEi2 —♦ Q*Ei2, pEn —* P'Esi, 

we deduce 

O = (a£Ji2,/3£?i3) —* 0 *^ 32 , 


which is absurd. Therefore we have either 

aEij —► a* Eij for all t 76 j and a G $ (84) 


or 

aEij —* ex" Eji for all * ^ j and a 6 (85) 

Suppose that we have (84), then 

Eu -» 

The mapping 

[1,72, ••- ,7n]^|l.72, ••- ,7n] _1 

leaves Ei's invariant and carries ^iEu into Eu, therefore after subjecting our Lie 
automorphism to an inner automorphism we can assume 

Eu —* Eu, i = 1 ， … ， n. 

Further, let E,j —* 7 ^ then 


[E lj ,E ij ] = E lj 





•316 - 


华罗庚文集代数卷 II 


is mapped into 

[Eu,lEij\ = jEij, 

thus 7 = 1 . Further, for j = 1, the element 

[Eii, Eu] = Ei—Ei 

is mapped into 

[ 五 1< ， 7 松 1] = (Ai - 人 )/ + Ei — Ei 

in case (i). Consequently 

(7 - l)(£?x - Ei) = (Aj - Xi)I. 

Then Ai — Ai and 7 = 1 . Consequently, we have 
⑴ Ei —* XI + Ei, Eij —* Eij. 

In case (ii), it is mapped into 

= (Ai - Xi)I - (Ex - Ei), 

then 

(7 + l)(£t-^i) = (Ai-A < )/. 

Then 入 1 = 人 and 7 = —1. Consequently, En —* —En for i 〆 1. For z ^ 1, we have 
[E<i,£i] = En, 

which is mapped into 

Aj/ - Ei] — En = -Eu, 

which is impossible. 

Similarly, with (85), we have 

Ei —* \I — Ei, E\i —» En, Eji —* Eij, 

En -* -Eji for i,j ^ 1 . 

Notice that the combination of (85) and (i) is impossible. The inner automorphism 
[— 1 , 1 ， ... ， 1 , 1 , … , 1 ] carries the automorphism into one satisfying 
(ii) Ei XI — Ei, —» —Eji. 

3) Principal matrices of rank 1. 

Let P 0) be a matrix satisfying 

PEi = E X P = P, PEi = EiP = 0 for 2 < i < ra. 
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For PEi = EiP = 0 or P, we deduce from Theorem 24 with D = C 2 and C = P that 
P is mapped into 弘 J 十 P* such that 

P*Ei = EiP* = 0 or P*. 

If P*Ei — EiP* = 0 for all i, then P* = 0 which is impossible. From P m Ei — EiP"= 
尸 ' we deduce P* = aEi. Therefore we have 

aEi —* XI + a*Ex, A € center. 

If i — 1, we have 

[a£?i,£^ife] = aE lk , k ^ 1, k^i, 

but 

= 0 and [a*£<, - £^i] = 0 ， 
both axe impossible. Therefore, since n 彡 3, we have 
aEi ― * XI + a*Ei. 

Therefore, we may assume that 

aEi —* Xil + 0iEi. ( 86 ) 

4) In conclusion, we established that either 

⑴ a 氏一 + a* t Ei, aEij -* a^Eij, 1^ = 1 

or 

(ii) aEi Hi(a)I — a^Ei, aE^ -a'jEij, ljj- = 1. 

For (i), the element 

aEi - aEj = [aEy .Eji] 

is mapped into 

(/i<(a) - iij{a))I + QiiEi - ctjjEj = [aijEij, Eji] = a^Ei - a^Ej. 

We have then, since n ^ 3, 

fH{a) = <*ii = a ij = a jj' 

that is, we have 

⑴ aEi —♦ /i(a)/ + a*Ei, aEij —* 1* = 1. 

Similarly for case (ii), we have 

(ii) aEi —* — a*Ei,aEij —* —a*Eij, 1* = 1. 
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Notice that the mapping a —» q* is one-to-one, for otherwise from a^o! and 
aEi —»/i(a) ± a*Ei, a'Ei —> n(a')I ± a*Ei, 

we deduce 

(a- a')Ei -* (/ i ( q ) - n(a'))I, 

which is impossible, since the right hand side belongs to the center but the left does 
not. Now the mapping (i) (or (ii)) is decomposed into two parts: 

A-* n(trA) (87) 

and 

4 = ( 88 ) 

The second mapping is a one-to-one mapping carrying R onto itself. It carries matrices 
of rank one into matrices of rank one, since the first mapping is independent of the 
choice of , • • • , E n . 

By Theorem 1, we have consequenently either 

A- = PA T Q, (89) 

where r is an automorphism of $ or 

A* = PA r 'Q, (90) 

where r is an anti-automorphism. Since the mapping brings central elements into 

central elements, we have Q = 0P~ l where P belongs to the center. FVom (89), we 
deduce A* = 0PA T P~ x , then 

[A,BY= 0P[A, B)P~ X = [A\B m ) = H*P[A, B]P~\ 

Consequently 0 1 = 0, i.e., /3 = 0 or 1. R-om (90), we deduce /? = 0 or -1. Therefore 
we have either 

A* = PA T P~ 1 
or 

A m = PA t 'P-\ 

the theorem is now proved. 

Remark For n = 2, the theorem is not true. For example, for the field of all 
complex numbers, we have a mapping 

«i + toa bi+ib2 \ f a — id^ &i + il>2 

Cj + ic 2 di +id 2 ) \ Ci + ic 2 di — ia 2 

which is not of the type described in the theorem. 




A theorem on matrices over a sfield and its applications 


• 319 - 


References 

Hua. Anncds of Math., 1949, 50: 8-31. 

Albert. Structure of Algebras. 

Ancochea. Annals of Math., 1947, 48: 147-153. 

Kaplansky. Duke Math. Jour., 1947, 14: 521-535. 

Ancochea. Jour, fUr Math., 1942, 184: 192-198. 

Hua. Proc. of Nat. Aca. of Sc., U. S. A., 1949, 35: 386-389 

Jacobson. Duke Math. Jour., 1938, 5: 534-551 and Amer. Jour, of Math, 1941, 63： 
481-515. 

Veblen and Whitehead, Foundations of Differential Geometry. Cambridge Tracts, 1932, 
29: 12. 

Artin and Whaples. Amer. Jour, of Math., 1943, 65: 87-107. 

Hua. Science Reports of Nat. Tsinghua Univ. Series A, 1948, 5: 150-181. 


1 . 2 3 . .4 5 .6 .7 



Supplement to the paper of Dieudonne on the 
automorphisms of classical groups* 


1. Introduction 

In a previous paper, the authorf 1 】 determined the group of automorphisms of 
the symplectic group Sp n (/f) over a field K of characteristic different from two. The 
method used there applies equally well to the general linear group GL n (iC) and after 
some complicated modifications to the orthogonal group O n (K,f), with a quadratic 
form / of index v ^ 1. Professor Dieudonn4l 2 l used an entirely different approach 
and comprehensively worked out the group of automorphisms of classical groups with 
several exceptions (see the last section of his paper). It is the aim of this paper to 
give solutions of some of the problems which he left open. 

The first difficult problem cited in his paper is the determination of the group 
of automorphisms of GL2(1T), where K is & sfield. For K of characteristic different 
from two he fails to characterize the transvections. Even if he were to succeed in 
characterizing the transvections, as he does when K is of characteristic two, the notion 
of semi-automorphisms of a sfield is still an obscure point of the final result. As a by¬ 
product of the present investigation, the author' 3 ' proves that a semi-automorphism 
(in the sense of Ancochea^ 4 ' and Kaplanskyt 5 !) Q f any sfield is either an automorphism 
or an anti-automorphism. Combining this with the result of Dieudonn6, we can 
immediately solve the problem of automorphisms of GL 2 (K) (and PGL 2 (/^)), for K 
of characteristic 2. In this paper we shall solve the more difficult problem concerning 
the automorphisms of GL 2 (K) for K a sfield of characteristic different from two. 

The second problem which he left unsolved concerns the automorphisms of the 
special linear groups SL 4 (/0 and PSL 4 (if). Since the dimension is higher, the prob- 
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etry of skew-symmetric matrices. Some new types of automorphisms appear; these 
surprising phenomena throw a new light on the study of the commutator subgroup 
Cl„(K,f) of the orthogonal group, which will be a subject of our later discussion. 

It seems worthwhile to mention that the nature of such problems show that the 
lower the dimension the harder the problem. Dieudonn6 adopted a method which 
worked smoothly for large n, and treated individually the cases with small n. As the 
author mentioned before, the difficulty increases as n diminishes; Dieudomi6’s method 
becomes very clumsy for smaller n, and sometimes he is unable to solve the case for 
smallest n. On the other hand the author's method starts with the least possible n, 
which is usually the most difficult case. Therefore the reader will have little difficulty 
in extending the special results of this paper to the general case by means of the 
inductive method used in [1]. Moreover, in contrast with that of Dieudonn4, the 
author’s method uses only the calculus of matrices. 

I. Linear groups 

2. Automorphisms of the general linear group GLa(K") 

Let be a sfield of characteristic ^ 2. An element M of GL 2 (K) is said to be 
an involution if M 2 = I. Since —I is the only involution commutative with every 
element of GL#2(/0> any automorphism of GLa(/f) must leave —I invariant. 

Let A and B be two anti-commutative involutions, that is 

A 2 = B 2 = /, AB = -BA. (1) 

These are invariant relations under any automorphism. Since A and B can be simul¬ 
taneously carried into 



by a similarity relation ①， we may assume that the automorphism under consideration 
leaves both matrices of (2) unaltered. 

It is easy to verify that a matrix commutative with both matrices of (2) takes 
the form 
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where X runs over all non-zero elements of the sfield K. Therefore the automorphism 
under consideration carries the group constituted by all the matrices of the form (3) 
onto itself. 

Let Kq be the center of the sfield K. Every matrix commutative with all the 
elements of (3) belongs to GL 2 (i^o) and conversely. Therefore our automorphism 
induces an automorphism on GLa(^o) which leaves both matrices of (2) unaltered. 

The parabolic elements ① are defined in the same way as before ([1], p.756); 
namely, if the characteristic p of the sfield differs from 0, a matrix A is called parabolic 
if A 2p = I; and in case p = 0, if there are infinitely many matrices similar to A 
and commutative with A. Since SLa(i^o) is generated by all the parabolic elements 
of GL 2 (i^o)’ the automorphism under consideration induces an automorphism on 
SL 2 (iCo). It was proved in [1] that every automorphism of SL 2 (Ao) is of the form 


A-» PA a P~ 1 , 


⑷ 


where P is a non-singular matrix and a is an automorphism of the field Ko- This is 
defined for A belonging to SL 2 (/fo). 

Since 



all belong to SL 2 (AT 0 ) for A belonging to SL 2 (K。)，and our automorphism leaves (2) 
unaltered, we deduce immediately that P is either Ai" or " ( ' (1) ) ■ That is, we 
have only, two possible cases of (4) ， namely, 

A-^A° (5) 


and 



⑹ 


We are going to discuss both cases separately. For the first case we have, in 


particular, 



for x e Ko, 


⑺ 


① Notice that the definition does not coincide with the usual definition of parabolic element. 
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a 

0 



a° 

0 


(；)-) 


for o 0) 6 Kq 


and 



Notice that 



⑻ 


⑼ 


Next we are going to consider those parabolic elements of GL 2 (K) which are 
commutative with (7). By commutativity they are of the form 


since they are parabolic, we have a = 士 1. Squaring all those elements, we deduce 
that our automorphism induces the mapping 



for all x belonging to K, noticing that the definition of x a is now extended to any 
element x of K. Evidently, we have 


{x + y) c = x a + y° 


( 11 ) 


for all x and y of K. 

Since 

, x{^0)eK 

is commutative with (8), and 




(12) 
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it is mapped by our automorphism into the following form 



Comparing the elements in the (1 ， 2) position, we have 
x°x* = 1, 

that is, (ar -1 )® = (x* 7 ) -1 , and 



for all x € if. Considering the element 

(xyx 0 \ = ( x ^ \ ( y Q \ ( X 0 、 

V 0 {xyx)- 1 ) = \0 x- 1 )\0 V- 1 J\0 x- 1 )' 

we deduce that 

{xyx) a = x°y a x°. 

Evidently 1^ = 1; therefore, by a theorem of the author ⑶， we deduce that the map¬ 
ping x —» s' 7 is either an automorphism or an anti-automorphism. 

Since Sl^K) is generated by the elements of (9) and (10) (Dieudonn^l 6 !), we 
have the following two possibilities: 

1) If i —> x* 7 is an automorphism, the automorphism of SL 2 (K) under consider¬ 
ation is 

A^A°. (13) 
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e have the automorphism, after an inner automorphism, 

A^{A >a r\ 


(14) 


where A' denotes the transpose of A. 

By similar methods applied to (6), we obtain the same results (13) and (14). 
Form now on we assume that the automorphism of GLa(K) under consideration 
satisfies (13) or (14) for A belonging to SL 2 (if). We now consider GL 2 (>K"); certainly 
the most essential part is to find the mapping of those elements of the form 


0 A 


X€K. 


(15) 


Since such an element permutes with any element (12), with x 6 Kq, it is mapped 
into an element of the form 

/ a* o N 
V 0 d J' 


Since 


is mapped into 


1 0 \ / 1 °\f l 0 

0 A y \ i 1 八 0 A, 


‘ (Ax) ff 


for all x. In particular setting ; 


dxx a aj l = (A®, 
=1, we have 
d\ = X a a\, 


that is, (15) is mapped into 

n )a* = d A ( 

Combining (16) and (17), we have 


1 0 
0 \° 


X~ a 0 
0 1 t 


(16) 


(17) 


(18) 


\ a axx° = (Ax) tf aA. 
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In case <r is an automorphism, we have 

a\x° -x a ax. 


that is, a 入 belongs to the center, and (15) is mapped into 



where a 入 is a representation of the multiplicative group into the center. 
In case a is an anti-automorphism, we have 

{\°a x )x° = x a {\ a a x ), 

that is, d\ = _\*^入 belongs to the center, and (15) is mapped into 



Therefore we have 

Theorem 1 Every automorphism ofGL 2 (K) is a product of an inner auto¬ 
morphism and one of the following two types: (i) 

x ( 取， （19) 

where a is an automorphism of K and x(^4) is a representation of GL 2 (K) into the 
multiplicative group of Kq\ and (ii) 

A — X ⑷ (A，- 1 )' (20) 


where a is an anti-automorphism of K. 

Remark The theorem also holds for sfield of characteristic 2. 

3. Automorphisms of SL^(Jif) 

We now consider a group SL$ (K) which differs slightly from SL2(/0. The 
projective geometry on a line over a sfield suggests that the group SL^ (K), obtained 
by successive application of harmonic separation, is of primary interest (Ancochea 171 ). 
As we know, in the group SL 2 (《〉，involutions do not always exist. More precisely, if 
-1 does not belong to the commutator subgroup of the group of multiplication, there 
does not exist an involution (a relation obtained by harmonic conjugacy). 



Supplement to the paper of Dieudonne on the automorphisms of classical groups - 327 _ 


By an easy investigation, we find that if —1 belongs to the commutator subgroup 
of the multiplicative group of K, then SL^(lf) = SL2(^0; otherwise, SL 圭 (Jf) is 
obtained from SL 2 (K) by adjoining an involution, such as 

1 0 
0 -1 

The automorphisms of SL 2 (K) are still to be determined when —1 is not in the 
commutator subgroup. 

The theorem which we shall prove in this section is the following: 

Theorem 2 Every automrophism ofSLf(K) is either of the form 

X {A)PA a P~\ (21) 

where P is a non singular matrix, tr is an automorphism of K, and x(^) a repre¬ 
sentation ofSL^K) into the multiplicative group of the center Kq, or of the form, 

x {A)PA' v - l p-\ (22) 


where a is an anti-automorphism. 

Remark Concerning X ⑷： in case —1 belongs to the commutator subgroup 
of the multiplicative group of K, then we have always x(^) = 1- Otherwise we have 
another possibility, namely 

1, if A belongs to SL2(/0 ， 

—1 ， otherwise. 

Proof of Theorem 2 Let us start with a pair of anti-commutative involutions 
A and B, that is 

A 2 = B 2 = /, AB = -BA. 

There exists a matrix P, which may not belong to SL* (K) such that 

PAP- 1 = ( 1 ° V PBP- 1 = ( °^ I 

\0 -1 J V 1 0 



An element of SLj (if) commuting with both matrices of (2) is of the form 





(23) 
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where either X 2 or —A 2 belongs to the commutator subgroup of the group of multipli¬ 
cation of K. By Theorem 4 of Hua! 8 】，every element commutative with all A belongs 
to the center Kq. Therefore a matrix commutative with ail elements of (23) is of the 
form 



where a, b, c, d belong to Ko, and the determinant of (24) (or its negative) belongs 
to the intersection of the center and the commutator subgroup of the multiplicative 
group. The remaining part of the proof is similar to that of Theorem 1. 

4. Automorphisms of PGL] (1C) 

We identify all the elements of the form 7 M of GL 2 (/T) as a single element 
of PGL 2 (A"), where 7 (/ 0 ) runs over all the elements of the center. A matrix M 
represents an involution if it satisfies 

M 2 = 7 /, -yeK 0 (25) 


and if M does not represent the identity of PGL 2 (/T). 7 is called the scalar of the 
matrix M. 

Since we have a matrix Q such that 


QMQ~ 


0 1 
c d , 


therefore an involution satisfying (25) is similar to the normal form 

/ 0 1 、 


(26) 


(27) 


If 


0 

7 



)are similar in PGL 2 (K), then from 


(: 





0 


where e belongs to the center, we deduce 


c = e6b, d = ea, ja = eSb, 76 = ec; 


consequently 


(7 - e 2 6)b = (7 - e 2 6)a — 0 . 
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Since a and b cannot be both zero, we have 7 = e 2 6. Conversely, we have 

(：：)-(o ：)(：：)(：；.)■ 

Therefore two involutions, represented by matrices with scalar 7 and 6, are similar in 
PGL 2 (if) if and only if 75 -1 is the square of a central element. 

Therefore, all the involutions can be classified into conjugate sets r Q , where a 
runs over all the elements of the factor group of the multiplicative group of the center 
by its subgroup of all square elements. 

We are now going to prove that the automorphism under consideration leaves T 1 
invariant. Fi is a conjugate set containing two distince commuting elements, namely 



It is enough to distinguish ri from those T Q containing two commutative involutions. 
Let r Q be a set containing two commuting involutions A and B with scalars a and 
a6 2 . FVom AB = eBA, e belonging to the center, we deduce 

BA 2 = Ba = aB = A 2 B = e 2 BA 2 , (28) 


that is, £ = ±1, Therefore {AB) 2 = ±a i S 2 I, so that AB belongs either toTi and r_i. 
Therefore our automorphism either leaves both r! and r_i invariant, or permutes 
them. 

1 °. If -1 is the square of a central element, then r_i coincides with Tj. 

2°. Suppose that —1 is not a square of a central element but is a square of an 
element o in K. The product of any two distinct commuting elements of belongs 

to r_i. In fact, let us fix one of them as >1 = ( : ' then, from AB = BA, we 

deduce that B is either the identity or ■yA, 7 belonging to center. Thus, we always 
have AB = -BA (the existence of B is shown by (2)), and (AB ) 2 = _/• In T_i we 
have two commuting elements 


whose product is 




a 

0 



0 
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which belongs to r_i. Therefore our automorphism cannot interchange Ti and r_i. 

3°. Next we suppose that —1 is not the square of an element in K. Let A and B 
be involutions of Ti and r_i respectively, and let them commute. We may fix A to 


be 


0 


).Then B takes one of the forms ： 



)• However, 


the first is impossible, since A 2 = —1 is insoluble. Therefore we have 


B 


0 

-A - 1 


A 

0 


The product AB belongs to Ti. Therefore the automorphism under consideration 
cannot interchange Ti and r_i. 

Take a pair of commuting involutions of rV Without loss of generality, we 
assume that the automorphism leaves (2) invariant. The elements commuting with 
both matrices of (2) are of the form 



by (28). Squaring all these elements, we obtain 


(o 




(29) 


(30) 


Consider the set of elements commuting with all these elements, that is 






(31) 


where p belongs to the center and may depend on a. Suppose p 一 0; then from 
pa 2 = pa 2 p, 

we have 
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that is 

(1 9P' 1 0 ) = ( a2 1 9P _1 \ 

y rp _1 sp -1 y y 0 o 2 ) \ 0 o 2 J y rp~ l sp ~ 1 J 

By Theorem 6 of Hua( 8 】，gp _1 , Tp~ l and ap~ l all belong to the center. By similarly 
treating the case p = 0, g 一 0, we reach the conclusion that 



where M belongs to PGL 2 (Ko)，and Aa 2 A -I a ~ 2 belongs to the center for all a, by 
(31). 

Consider those elements of the form 

(AM)(mN) 2 ( 入 i € K 0 , 

where M and N belong to PGLa(ifo)- The elements MN 2 M~ l N ~ 2 generate the 
group PSL 2 (/Co); in fact, they generate a normal subgroup of PSL 2 (if 0 )，but since 
PGL 2 (/fo) is simple ①， we have the assertion. Therefore the automorphism of PSL 2 (/^) 
induces an automorphism on PSL 2 (ifo). 

Now we use non-horaogeneous representations: an element of PGL/ 2 (^ 0 ，repre¬ 
sented by a matrix ( ° ), can be realized as a transformation 

\ c d J 

zi = (az + b)(cz + d )' 1 (32) 

on a projective line over the sfield K. It is evident that the realization is one-to-one. 

We define a parabolic element t by the following properties: if the characteristic 
p of our sfield is not zero, a transformation t is called parabolic, if t p equals the 
identity of PGL 2 (iO; and, for p = 0 , a transformation t is called parabolic, if there 
are infinitely many elements similar to t and commutative with t. We shall now prove 
that every parabolic element of PGL 2 (/^o) 從 11 be carried into the form 

Z\ = z + o (where a belongs to Ko). 

In fact, suppose p = 0, and that t is represented by a matrix T belonging to SL 2 (iCo)， 
where 
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Since 7^ belongs to SL 2 (ijTo)，we have a = ±1. As in [1], we have our assertion. In 
case p = 0, the argument of [1] can still be applied. 

After we have talcen care of the parabolic elements, we can follow either the 
method of Schreier-van der Waerden^ or the method of the author ⑴ to obtain the 
automorphism of PSI^K。). More precisely, we may assume that our automorphism 
of PGL 2 (K)，induces an automorphism of PSL 2(/0 which carries the transformations 

zi = z +1 to zi = z +1°, t € Kq (33) 

and 

Z\ = —1/z to Z\ = — 1/z, (34) 

where ^ is an automorphism of the field Ko- 

Consider all the parabolic elements commuting with the translation group (33). 
It is easy to see that they are of the form 

zy = z +1, t £ K. (35) 

Therefore, our automorphism carries (35) into 

Zi = z + t v , 

where t a is defined for all t belonging to K. From the identity (12), we deduce by 
the same argument following (12), that t a is either an automorphism or an anti¬ 
automorphism of the sBeld, and 

Theorem 3 Every automorphism ofPGLiiK) is induced by an automorphism 
o}GL 2 {K). 

Remark The theorem is still true for characteristic two. 

5. Automorphisms of PSL^K - ) 

As in §4, we classify the involutions into conjugate sets; the situation now be¬ 
comes very much more complicated, however, since neither can the scalar be arbitrary, 
nor dose the scalar determine a unique conjugate set. It is not always possible to find 
the matrix Q, belonging to SL^(/f), such that (26) is true. A matrix of scalar 7 is 
similar to a matrix of the form 


0 q~ 
79 0 


(36) 
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under SL 2 (AT)- A conjugate set r Q of §4 may now break up into several conjugate 
sets. Nevertheless, the same argument gives us the result that the pair of sets T i and 
r_i is invariant under automorphisms. 

Fortunately Ti does not split into several conjugate sets; in fact, 

1 0 
0 -1 


and . 

1 0 
Q 1 

which belongs to SL 2 (/ 0 . 

As in §4, we divide our treatment into three cases: 1° and 3° can be treated 
analogously. The only difficult case is where (2°) - 1 is a square of a non-central 
element a. Now r_i splits into several conjugate sets. We shall now prove that every 
involution (namely (36) with 7 = —1) of r_i is similar to 

° i ) (37) 

0 -qaq ) 





under the group SL 2 (^ 0 . In fact this is a consequence of the following simple calcu¬ 


lation: 

a 0 \ 

0 -a / ' 
(38) 

a ： )( ： f)(-^)(：：r 



Then (36) with 7 = —1 is similar to (37) under SL 2 W，since 


belongs to SL^/C). Putting qa instead of q in (36), we find that 


( a 0 

and 

( 0 ㈣ - 1 ) 

(39) 

V 0 -qaq -1 J 

> 

V -ga 0 J 



are similar under the group SL 2 (K). Their product is 

(-/：)• 
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which is similar to (39) too. Therefore any set 7 contains three mutually commuting 
elements. 

As we proved beofre (§4), the product of two commuting elements of Ti is not 
an element of IV 

Therefore the automorphism carries Ti onto itself. By an argument similar to 
that of §4, we may assume that our automorphism leaves 

(；：)'(-) 

invariant. 

The elements of PSL 2 (K) commuting with both these elements are of the form 



Squaring all of these elements, we obtain 

(； 2 ：0- 

Since they belong to PSL〆/^)，then a 2 or -a 2 belongs to the commutator subgroup. 
Using the lemma below, we proceed as in §4, and obtain 

Theorem 4 Every automorphism o/PSL^K) ia induced by an automorphism 
o/SLf(K). 

Lemma Let C be the commutator subgroup of the multiplicative group of K 
and let C 2 be the group generated by all the square elements of C. Then either K is 
commutative, or the sfield K is generated by all the elements of C 2 . Consequently, if 
an element commutes with all the elements of , then it belongs to center. 

Proof If C 2 is not contained in the center, the lemma follows from Theorem 
1 of Huai 8 ]. 

Now we suppose C 2 is contained in the center Kq and K is not commutative. 
Let 

1,01,02,- ■ - ) «< € C 

be the representative system of the factor group of C over C 2 . Since (7 generates the 
sfield, by Theorems 4 and 1 of Hua 阆 ， we have therefore 

K = Ko(a\,02, - ■ - )t 
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o? = 7i» ( 0 * 0^) 2 = yajSij 

and 7 ’s and 6’s belong to the field Kq. 

Since K is not a field, there is a ^ 1. we may suppose that 5。 = 5 一 1, 
From 

01(02 + 1) = 5(^2 + l)oi + ai(l — 6 ), 


ai (02 + l)oi' 1 (o2 + l) -1 = ^ + (02 + 1) *(1 — <^)* 
Suppose that the squares of commutators belong to the center. Then 


{S + (a 2 + ir 1 {l-6)) 2 = 0, 


where (3 belongs to Kq. Consequently 


(5o 2 + l ) 2 = 0(a 2 + l) 2 , 


we deduce immediately 


(02 + l)oi = jai (02 + 1) + Oi ( 


we deduce similarly 


Consequently 5 = —1 and 72 = —1. Similarly 71 = —1. 

Suppose that Ko has more than three elements (it is easy to prove that 

- ,there does not exits such a K), Take x of Ko different 


, ., 1 — x 2 2x 

= (i-xa 2 )(i+xo 2 r = _ r+^ 2 a： 



s is a contradiction. We have therefore the lemma. 
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6. Automorphism of SL 4 (i^) 

The case left open by Dieudonne concerning the automorphism of SL< 4 (iC) is that 
where K is not commutative, K has characteristic 一 2 and —1 is not contained in 
the commutator subgroup of the multiplicative group. 
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(Ki = satisfies 

K2J2 = JiK^ K2J3 = —J1K2, 哈 =—J2 

and K 3 = Kf satisfies 

K 3 J 3 = J 3 K 3 , K 3 Jx = -JaATa, K\ = -J 3 ). 


(44) 


(45) 


The subgroup commuting with J\ consists of all elements of the form 



where A and B are two-rowed matrices. Consider the group G formed by 



(46) 


(47) 


Since all of the elements of the form A [ ) A -1 generate (Dieudonnfi^) the 

V - 1 °/ ,, 

group SL 2 (iiO， 』 the elements of the form (47) generate a group formed by all 
elements of the form 

where A runs over all elements of SL 2 (K"). Squaring all of the elements of (48), we 
obtain 

(0 ；)■ 
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Since SL 2 (if) is also generated by all of its square elements (Dieudonn^^) it follows 
that our automorphism induces an automorphism on the subgroup of elements 

(“)’ 

where A runs over all the elements of SL 2 (/T). 

If we also use J 3 , our automorphism induces an automorphism on 


where A runs over all the elements of SL*(K). Notice that the integer l is uniquely 
determined by A. More precisely, in case —1 belongs to the commutator subgroup of 
the multiplicative group, then l is always even, otherwise l is even for A € SL 2 (iO 
and l is odd for A ^ SL 2 (i^). 

By Theorem 2, our automorphism gives us the following mappings: 


A 0\ ( PA°P~ 1 0 ' 

.。小 l 

where <r is an automorphism of K or 

,0 ) ( PA' a ~ l P- 1 

0 I )^{ 0 , 

where <r is an anti-automoiphism of K, and where A runs over all the elements of 
SL2(/0. Consequently our automorphism cannot permute and iff 1 , and must 
keep invariant. By an inner automorphism, keeping invariant, we may assume 
that our automorphism induces either a mapping 


(49) 


(50) 


H 


A' a - 

0 


(51) 


(52) 


Using J 2 and K-i instead of J\ and Ki, we can prove that our automorphism 
keeps K 2 invariant, and similarly . Since SL 4 (/f) is generated by , K 3 
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and 

( 1 a: 0 0 ^ 

0 10 0 

0 0 10 ' 

0 0 0 1 / 

we have the following theorem: 

Theorem 5 Every automorphism ofSL^K) is the restriction of an automor¬ 
phism of GLi(K). 

Since there is no essential difficulty in the case of PSL 4 (/f)，the author will not 
discuss it here. 


II. Orthogonal groups 

The case of the orthogonal group left open by Dieudonn^ is (K, f) , where K 
is a field of characteristic ^ 2 and / is a quadratic form of index 2. 

7. Preliminaries 

Theorem 6 (Witt【 10 】） Every quaternary quadratic form of index 2 is equiva¬ 
lent to 

f = *1*3 + X2X4, 

whose matrix is 

5 

where I and 0 are two-rowed identity and zero matrices. 

FVora now on, we assume that our fundamental quadratic form has the matrix 

3- 

A four-rowed matrix % satisfying 

1ST = 5 (54) 

with determinant of 1 equal to 1, is an element of Oj (K). If we put 

:)， A = etC '' ( 55 ) 

then we have 

AB' + BA' = 0, CD 7 十 iXT = 0, AD' + BC' = I. (56) 
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Since T’ is also orthogonal, we deduce 

^0 + 0^ = 0, B'D + D'B = 0, A'D + C'B = I. (57) 

Later we shall need another group containing as its subgroup and which is 
defined by the matrices satisfying 


VW = o-S, 


(58) 


where a is an element 0) of the field. This group is denoted by GO 4 = GO^K, /). 

Theorem 7 If% belongs to then in the expression (55), A is either zero 
or non-singular. 

Proof Suppose that is of rank 1, we have two matrices P(= 尸 ( 2 )) and 
Q (= (J( 2 )) such that 

， O' 


PAQ 


ill] 


( p 

1 and 

( Q \) 

V 0 ^ y 

1 

V 0 cr l ) 


belongs to O 4 , without loss of generality we may consider those T with A = 
From (56), AB' is skew symmetric so we may deduce that 


10 
0 0 , 


and from (57) 


Again from (56) 


in conclusion, we have 


0 62 

、0 v 


0 0 
,c 3 C4 , 


649^0 


,& 4 C 4 = 1; 




Its determinant is —& 4 C 4 = —1. This proves Theorem 2. 

Remark The converse of the theorem is also true, i.e” any T satisfying (54) 
with non-singular A must belong to Oj, This is an easy consequence of Theorem 8 . 

Theorem 8 Write /f =( 。丄 )• Every element of O: can be expressed 

V _1 0 / 

uniquely as one of the following forms: 

fV W M yAK , f (59) 

V xK I )\0 乂 , - 1 八 0 I ) V xKA ^-xyd{A))A'~ l ) 

and 



where x, y and z are elements of K and A is a non-singular two-rowed matrix. 
Proof In case A is non-singular, from (56) and (57) we deduce 

B = yAK, C = xKA 

and 

D = xyKAK + A ,_1 = (1 - xyd{A))A'~\ 

since AKA' = d(A)K. The case = 0 can easily be expressed as (60). 

The uniqueness can also be proved without any difficulty. 

Definition We use 

(s,i4,y), { 00 , A, z) 

to denote the matrices of (59) and (60) respectively, which may be called the coordi¬ 
nates of an element of O 4 . 

8. An extra type of automorphisms 

From our previous experience, we might expect that the only automorphism is 

where 爭 belongs to G0 4 (K). But this is entirely false; there are many automorphisms 
of O^(K) which are not those of O^K). 


① We use d(A) to denote the determinant of A. 
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Let a —♦ a T be an automorphism of the field. Let ip(a) be a representation of the 
multiplicative group of the field into itself, and let it satisfy 

^ 2 (a) = (61) 

This is called a representation induced by the automorphism. 

Example 1 Let K be a rational field, The identity automorphism is the only 
one. We write a rational number as 


r = ±2 i *3 Ia - - 必 . 


where p„ denotes the n-th prime, and the l n are integers. We fix a prime p n . Then 
we define 

， (r) = (-”*"• 

This evidently satisfies (61). There is exactly one other induced representation 


<P(r )= 


1 , if r>0, 
-1, if r<0. 


Let be a real field. The induced representation 

<^(o)= 


f 1, if a > 
\ -1, if a < 


is the only one. 

Example 3 


Let K be & complex field, and a T be the conjugate of a. Then 

•P(o) = M/a 


is an induced mapping. Since 

荅 = ^ = 化⑷ ) 2 ， 

this does not exhaust all of the induced representations of the complex number field, 
which contains infinitely many automorphisms. 

Now we express the previous notion in matrix form. We define 

X (A) = ^(det A). (62) 


Theorem 9 The mapping 


(x,A,y) -> {x T ,x{A)A,y T ) 


(63) 
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defines an automorphism of O^, where is related to r by (62) and (61). 

Proof The theorem follows from Theorem 8, if we can prove that (63) carries 
the following identities into their corresponding ones: 

(0,/,y)(*i,/,0) = (xi(l - yxi) -1 , (1 - yxi)I, y(l - yxi) -1 ), if x x y - 1, (64) 

= (oo.xi/r.-xi -1 ), (65) 

(0, J ， ;c)(oo ， / ， 0) = (—x~ 1 t xK, —® -1 ) (66) 

and 

(0M,0)(oo > / I 0) = (oo,^~ 1 ,0). (67) 

To verify (64), we need to prove that 

(yl(l- y T x[)-\{\ - y r x-[)I,y r {l- » T *I) _1 ) 

=((*i(i j - 1 ) T ,x((i - y*i)^)(i - yxi)i, (y(i - y*i) -1 ) T ). 

The first and the last terms are evidently equal to the corresponding ones, by the 
definition of an automorphism. The middle terms are equal, since 

x((i - yxi)t) = v((i - y*0 2 ) = (v(i - »*i)) 2 = (i - » T *I)/(i - »*i)* 

To verify (65), we need only establish that 

x\K = \{x\K)x\K. 

This is also true, since 

x{x\K) = = x\fx x - 

The other two can be proved similarly. 

9. Structure of the group O 4 (K, f) 

Theorem 10 The commutator subgroup (2^ (K, /) of (AT, /) consists of 
those elements 

where the determinants of A, B, C, D are square elements of the field. 

Proof Notice that from the expressions (59) and (60) we deduce easily that if 
one of the determinants of A,B,C,D is & square element different from zero, then all 
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the others are squares. By means of (59) and (60)，we can easily verify that all the 
elements of (68) form a normal subgroup. 

The factor group of (K, f) by (68) is generated l>y 



0 

0 

0 


0 0 
入 0 
0 1 
0 0 



and 



and it is abelian, since 


r 1 0 0 0 
0 A 0 0 

0 0 10 
\ 0 0 0 A -1 




0 \ 

:(“) 


belongs to (68). Therefore n}(lf，/) is contained in (68), by the property of commu- 
stor subgroup. 

Next we are going to prove that the subgroup (68) is generated by 


0 




(69) 


and 


where A belongs to SL2(/0. In fact, the group generated by (69) and (70) evidently 




Since a matrix Q of determinant q 2 can be expressed as -Q ql, the group generated 
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by (69) and (70) also contains 


f Q 0 V 

_ (V 0 ) 

0 、 

V 0 O'- 1 ) 

( 0 J 



(72) 


Evidently (69), (71) and (72) generate the group formed by (68). Therefore (69) and 
(70) generate the group (68). 

The group (K, f) contains 


M 0 ) 

if B 0 ) 

j a ° Y 

7 B 0 ) 

VO A'- 1 ) 

1 V 0 B'~ l ) 

V 0 A'~ l ) 

V 0 B'- 1 ) 


so it contains (70). Since 


(rl 0 \ 


/ rl 0 、 

r 1 ^ 叶 1 _ 

(I «(r 2 - l)K \ 


U i) 

\0 


"V° 1 ) 


contains the first term of (69); similarly it contains I 


yK 





-K 

0 



0 


therefore /) contains (69) and (70), and consequently if contmns the group 

(68). The theorem follows. 

Theorem 11 The group PQ^K,/) is a direct product of two groups each 
of which is isomorphic to PSL 2 (AT). More precisely, can be expressed, as 

a direct product of three irreducible components: one formed by the identity and its 
negation, and two being isomorphic to PSL 2 (>K"). 

Proof The two subgroups are the one formed by (70) and the one generated 
by (69). More precisely, we may write the second as 


(ql rK \ 

\ -sK tl ) 


qt — rs = 1. 


(73) 


The correspondence of (73) and ( 9 I established the isomorphism of the group 

V 4 / 

(73) and SL 2 (/f). 
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The intersection of the groups (69) and (73) is given by 


which is the identity of /). Moreover, for determinant of A equal to q 2 , we 

have 


A xAK 

yKA (1 - xyd(A))A'' 




ql qxK 

qyK (1 - xyq 2 )q~ l I y 


( 0 A \ 

(-AK 0 \ 

( 0 

(f 1 zA'~ l K j = 

V giAK)'- 1 } 



Theorem 11 follows from the definition of the direct product. 

Theorem 12 Every automorphism o/O^K, f) is a combination of 


(74) 


where D belongs to GO\ and a is an automorphism and that given by (63). Those of 
PO^(K,f) are induced from the automorphisms of 0^(K, /). 

Proof Since (or is a characteristic invariant subgroup, 

an automorphism of 0 + {K,f) (or PO^) induces an automorphism on (or 

PCl^). Since (K, f) is a direct product of three groups Gq, G\ and (?2, where Gi 
and G 2 denote the subgroups of PO 文 (K 、 f) represented by (70) and (73), and since 
these axe isomorphic to PSL 2 (if)，it follows that the automorphism of ^4(^,/) can 
be obtained by those of Gi and G2 and by adjoining one which permutes G\ and G 2 
(since Go has only identity automorphism). 

fab 

Since we have, by putting A = 

\ c d 


/ 1 

0 

0 

0 、 



( 1 

0 

0 

0 、 



0 

0 

0 

1 

(A 

B \ 

0 

0 

0 

1 

= ( aI 

bK \ 

0 

0 

1 

0 

\C 

D) 

0 

0 

1 

0 

\ -cK 

dI ) 

V o 

1 

0 

0 J 



(0 

1 

0 

0 ) 




therefore, the permutation of the two components of Gi and G2 is merely an auto¬ 
morphism induced by an element of 0 4 (K,f). Hence after an automorphism of the 
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type (74) if necessary, we may assume that the automorphism of nj (K, f) induces 
automorphisms on Gi and G 2 , say 



and 


( qI 

rK )^±( 


nK \ 

卜 / 

rK \( 

qii 

riK V 

\ -sK 

^ ) \ 

-siK 

til ) 

、 -sK 

tl ) \ 

-SiK 

hK ] 


(76) 

where a and t axe two automorphsims of the field. Squaring the elements on both 
sides of (74) and (76), and since SL2(AT) is generated by all its square elements, we 
may omit the ± signs in (75) and (76). 

We shall now construct a matrix I belonging to GOt{K,f) which effects both 
(74) and (75) at the seune time. In fact, let the determinant of Q be A, and write 

R=\ 1 ° ) Q. Then 

VO W 
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and 





( 

Qil 

rxK \ 

if 

9 / 

rK \ 




-siK 

hi ) 

'l- 

-sK 

) 

/ 1 

0 

0 

0 ) 



( Qi 

0 

0 

0 

入 

0 

o 1 


and 

0 

9i 

-n 

0 

0 

A 

0 



0 


ti 

\0 

0 

0 

ij 




0 

0 


<hl 

-SiK 


riK 


evidently commute. Therefore, after an automorphism of the type (74), we may 
assume that our automorphism under consideration induces the mapping 






(77) 


and 


卜 / 

rK ) 


t t K \ 

\ -sK 


\ -S T K 

) 


(78) 


If K contains more than three elements, we have the following particular case of 


(78): 

(al 。 _ 广 M 。 

L 0 广 \ 0 

Thus elements which commute with (79) are of the form 


) ， a T — ±1. 


(79) 





(80) 
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by Theorem 1, we have 


/ p 0 K ^ p ) p,t ° 

\ o p 7-1 y \ o 

from (77), we have = 1, and from (78), 

xW = q T /q- 


Consequently 




x(qI) = q t /q 


and 

(x(^)) 2 = detX T /deti4. 


Our theorem is proved for K containing more than three elements (certainly a 
little detail is needed to complete the proofe of and PO^). In case K contains 
only three elements, the factor group O4 is of order two, and it is represented by 



Therefore we need only find the mapping of ll. ii is mapped to ilip, where 屯 belongs 
to fiU _ There are two ways to map the element 


( A 

卜 / 

rK )^ = ( 

V° A， J 

\ -sK 

^ ) \ 


il 

they give us the identity 

^{ A \ 

VO A r ~ ] 


)( 


q r I r T K 
-s T K t T I 


UAU~ l 

0 


V _1 u _1 : 


0 

W <7/ ~rK \ 


J\sK tl ) 

(UAU- 1 

0 \ 

= l 0 

(UAU- 1 )' -1 ) 


-t t K \ 

V s T K 

^ ) 


for all possible A,q,r,s and t. It follows that is either an identity or its negation, 
and we obtain the same conclusion. 
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Automorphisms of the unimodular group* 


Notation Let £0T„ denote the group of n x n integral matrices of determi- 
nant ±1 (the unimodular group). By we denote that subset of fJJln where the 
deferminant is +1; 9Jl~ is correspondingly defined. Let /(") (or briefly /) be the 
identity matrix in fJJln, and let X' represent the transpose of X. The direct sum of 
the matrices A and B will be represented by A -j- 

A = B 


will mean that A is similar to B. In this paper, we shall find explicitly the generators 
of the group 14, of all automorphisms of VJln- 

1. The commutator subgroup of 

The following result is useful, and is of independent interest. 

Theorem 1 Let be the commutator subgroup of 2Jl„. Then trivially C 
9JIJ. For n = 2,9K n is of index 2 in 971+, while /or n > 2, £Rn = 9JI+. 

Proof Consider first the case where n = 2. Define 



It is well known that S and T generate 取 }. An element X of OTj is called even if, 
when X is expressed as a product of powers of S and T, the sum of the exponents 
is even; otherwise, X is called odd. Since all relations satisfied by S and T are 
consequences of 

5 2 = -I, (ST) 3 = I, 
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if X e SDtg , X and J~ l XJ are of the same parity. This is easily verified iox X = S 
or T; since S and T generate 9Kj, and J^XxX^J = J~^X\J • J~^X 2 J, the result 


On the other hand we can show that 0 C W 2 . For, 0 is generated by T 2 and 
ST, since TS = {ST- T~ 2 ) 2 . However, T 7 = TJT~ l J l £ 5K 2 , and therefore also 
(T')~ 2 € iRa- Furthermore, ST = TST^S- 1 ^)-^ 2 E fR 2 - This completes the 
proof for n = 2. 

Suppose now that n > 2, and define 


(the symbols S and T defined here are the analogues in 9H+ of those defined by (1). 
It will be clear from the context which are meant). For n > 2, we have® 


T' = [R-^TR)-^-^RiTR^-^iTRy^RiTR)-^-^R~ l {TR) n ~ 2 ](TR) 6 


Further S = TST^S^fT^T € W„. Finally, for odd n there exists a permutation 
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Set iV = Ja + d|. If JV ^ 3, consider the elements generated by S and T T (mod N). 
Since a + d = 0 (mod N), we find that (7 ^) 2 = 7 (mod N). Furthermore (5r r ) 3 = I 
(mod N)] therefore S and T 7 generate (mod N) at most the 12 elements 
±7, ±S, iT, iST 1- , 

But if r is an automorphism, S and T T generate which has more than 12 elements 

(mod N) for JV > 3. 

Therefore N ^ 2. Since ad > 0, either a = d= lora = d= —1, and thence 
b = l, c = 0orb = 0, c = —1. There are 4 possibilities for T^: 


T 0 = 

( l M, 

T 2 = 


V 0 1 / 


1^=1 




卜 

(-11) 

, r 3 = 

U -0 


Since S and T generate SJlJ, to determine t it is sufficient to specify S r and T T . Thus 
every automorphism of is of the form S —► BSB -1 , T —» BTiB~ l (for some i, 
i = 0,1,2,3), where B € If J is given by (2), we have 

T 0 = T, Ti = STS~\ T 2 = -JTJ~ l , T 3 =-SJTJ^S- 1 , 

and also S = —JSJ~ 1 . The possible automorphisms are: 
i = 0 ： S — BSB- 1 ，T BTB- 1 ; 
i = 1 ： S -y BS-SH 1 ，T BS T S~ l B~ l \ 
i = 2 : -BJ S T^-BJ T J- l B~ l \ 

i = 3 ： S-^-BSJSJ^S^B- 1 , : r-f-BS^/rJ-iS-W^- 
These automorphisms are of two types: for i = 0 and 1, S —» ASA -1 , T —* ATA~ l , 
which imply that X 6 —* AXA~ l ; for i = 2 and 3, S —* —ASA~ l , T -* 

-ATA- 1 , which imply that X -* e(X) ■ AXA~ l . This completes the proof. 

3. Automorphisms of and 9Jl„ 

We are now faced with the problem of determining the automorphisms of OT 2 
from those of We shall have the same problem for and 37t^. As we shall see, 
the passage from to ®l„ is trivial, and most of the difficulty lies in determining 
the automorphisms of In this paper we shall prove the following results: 

Theorem 3 For n > 2, the group of those automorphisms of which are 
induced by automorphisms o/9Jln is generated by 
(i) the set of all “inner” automorphisms 
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^f€£Dl+ -> AX A- 1 (A e Win) 

and 

(ii) the automorphism 

尤 — n 

Remark When n = 2, the automorphism (ii) is the same as X —+ SXS~ y , 
hence is included in (i). The automorphism X —♦ g(X) - X occurs only for n = 2. 
Furthermore, for odd n all automorphisms of 3JI+ are induced by automorphisms of 

Theorem 4 The generators ojiXn are 

(i) the set of all inner automorphisms 

X em„ — AXA- 1 (Ae OT„); 

(ii) the automorphism X e 切 In —* X'~ x \ 

(iii) for even n only, the automorphism 

X e sm n — (detX) - X 

and 

(iv) for n = 2 only, the automorphism 

Xem^ -y e(X) - X, Xem^ e(JX) - X, 
where J is given by (2). 

Further, when n = 2, the automorphism (ii) may be omitted from this list. 

Let us show that Theorem 4 is a simple consequence of Theorem 3. Let r be any 
automorphism of 9Jt„- By Corollary 1, r induces an automorphism on which, by 
Theorems 2 and 3, can be written as: 

a(X). AX*A~\ 

where A € ^^(X) = 1 for all X or q(X) = e(X) for all X (this can occur only 
when n = 2), and where either X* = X for all X or X" = X'~ l for all X. 

Let Y and Z € 9ER~; then 

Y r Z T = (YZ) r = a(YZ) - A(YZ) 9 A^, 

whence 

Y T =a(YZ)-AY m Z* A- 1 (Z r )~\ 

Let Z € be fixed; then 

Y T = a(yZ) - AY m B for aU K G 
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where A and B are independent of Y. But then 

AY*B- AVB = (Y T ) 2 = (Y 2 ) T = a{Y 2 )A(Y 2 y A~ x , 

so that 

{BA)Y m (BA) = a(Y 2 )Y\ 

Since this is valid for all Y E we see that of necessity a(y 2 ) = 1 for all Y, 
and BA = ±/. This shows that either Y T = a(YZ) - AY*A~ 1 for all K e 9Jl~, or 
Y T = ~a(YZ) - AY m A~ l for all K € m;. If n = 2 and a(VZ) = e(YZ), it is trivial 
to verify that either e(YZ) = £(JY) for all K 6 or e(YZ) = —e(JY) for all 

vesm^. 

The remainder of the paper will be concerned with proving Theorem 3. 

4. Canonical forms for involutions 


In the proof of Theorem 3 we shall use certain canonical forms of involutions 
under similarity transformations. 

Lemma 1 Under a similarity transformation, every involution X e OTl n such 
that X 3 = can be brought into the form 

W(x, y,z) = L+-- +L+(-i) (w 4/ (i) , (4) 

(z xterms) 

where 2x + y + z = n and 

L 

Proof We prove first, by induction on n, that every X G fW n satisfying X 2 = I 
is similar to a matrix of the form 



/ /« 0 ) 
M -I ( n _ 0 ) 


⑼ 


For n = 1 and 2, this is trivial. Let the theorem be proved for n, and assume that 
X 2 = J( n+1 ), where n > 2. Then X 2 -1 = 0, or (X - I)(X + I) = 0. If X - J is 
nonsingular, then X = -I and the result is obvious. Hence, supposing that X - I 
is singular (so that 入 =1 is a characteristic root of X), there exists a primitive 
column vector t = ,tn+iY with integral elements such that t'X = t'. Choose 

P e 97ln+i with first row t'. Then 


pxp- 1 = 



n' 

Xx 
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Supposing this done, secondly put A = I,D = I\ we find that M = M + 2C. Since 
C is arbitrary, we can bring M into the form 

/ /( fc ) 0 \ 

I 0 oj’ 

where k is the rank of M. Since we can interchange two rows and simultaneously 
interchange the corresponding columns by means of a similarity transformation, the 
lemma follows. 

It is easily seen that 

W(x,y, z) = W(x, y,z) 


only when x = x, y = y, and z = z. Furthermore, changing the order of terms ia the 




A nonsimilar 



After a suitable inner automorphkm, we may assume that either Ii —»Ii or Ii 
We shall show that this latter case is impossible by considering the normalizer j 
of I\ and I 2 . The normalizer group of I\, that is, the group of matrices 6 OT 3 
commute with Ii consists of all elements of of the form 



and is isomorphic to 取 2 . That of I 2 consists of all elements of OT 3 of the fonr 



and is isomorphic to that subgroup of 9^2 consisting of the elements 


f 。 t 

\ esl I 

f 1 

) 6 职 2 ， where hsO > 

\ h i 


’ is 1 J 


Since e and i are both odd, (5 contains no element of order 3, and hence 
isomorphic to OT 2 . But then Ji —» J 2 is impossible. 
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so that 


Since 



we may assume (after a further inner automorphism, if necessary) that Ii, M and N 


are all invariant under the automorphism (but (8) need not hold). 

Thus, after a suitably chosen inner automorphism, we have Ij , M and N invari¬ 
ant. Therefore there exist A, B and C € OT 2 such that 




I emt- 


±AXA~ l 

W 


n 

±1 



and n = (0,0/. Here, the +1 on the left goes with the +1 on the right always (and 
the — l’s go together); further, when X is an even element of t the plus sign occurs 
before AXA~ l , BXB~ l and CXC~ X , while if the minus sign occurs before one of 
these for any odd X G j it occurs there for every odd X G SJlJ • 

Now we may assume that at most one of A, B and C has determinant —1; for if 
both A and B (say) have determinant —1, apply a further inner automorphism (with 
factor N) which leaves I\, M and N invariant and changes the signs of det A and det 
B. Suppose hereafter, without loss of generality, that det 乂 = detB = 1. 

Next, N is invariant, but by (9) goes into 
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so that 



This gives two possibilities: 



The same holds true for B (but not necessarily for C, since det C = ±1). 
Suppose firstly that either j 4 or B is say A = J ⑵. Then 



Case 1 T invariant. Then 


f 0 10 、 

(010 、 

-10 0 

and 1 1 0 0 

V 0 0 1 J 

( 0 0 -1 J 


are both invariant (the first matrix is invariant in virtue of the remarks after (9); the 
second is invariant because it is M times the first). For either possible choice of B 
we find that 



is mapped into 




if + is used, 
if — is used, 
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where V — IiUI^ 1 Thus, in this case, T —* T = IiTI^ 1 , and either U —* U ox 
U —» IiUIi 1 . Since T and U generate ① QJig , the automorphism is inner. 


Case 2 



0 


0 

0 


Then 



If we set Z = TU 2 , then 

(10 0、 

110 =(UZ~ l ) 2 UZ 2 . (10) 

V° 0 w 

Now certainly the left side of (10) maps into 



whereas, knowing T T and U T , we can compute Z T and thence can find the image of 
the right side of (10). We readily find (for either value of U T ) that the right side of 


(10) maps into 

( 1 

3 

\ ' 

and hence we have a contradiction. 

Therefore Case 2 cannot occur, and so if either Aot B equals /( 2 ) ， the automor¬ 



phism is inner. Suppose hereafter that 



① Hua L K and Reiner I, loc. cit. 
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Proof By Corollary 1, J{ € 2Jl~, and JJ is an involution. After a suitable 
inner automorphism, we may assume that J{ = W(x,y,z) (as defined by (4)), where 
2x + y + z = n and x + y is odd. Every element of 9JT„ which commutes with J\ 
maps into an element of 9Jl„ which commutes with W. Every matrix in maps 
into a matrix in SDl^. Combining these facts, we see that the group 0i consisting of 
those elements of OTJ which commute with J\ is isomorphic to 02 , the corresponding 
group for W. If we prove that this can happen only for a: = 0, j/ = 1, z = n — \ or 
a ： = 0, 3 / = n — 1 , z = 1 , the result will follow. 

The group consists of the matrices in 971J of the form (±1) -j-Xi, Xr e OTln-i, 
and so clearly ( 8 i S 田 ln-i. 

The group 02 is easily found to consist of all matrices C G £Dtf of the form (we 
illustrate the case where x = 2 ): 



For the moment put 



{x terms) 
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0 


a 3 

0 

0 

KCK 1 = • 


0 


m 


0 ai 

di 0 

0 a<t 

d 3 0 

-2ei 0 

— 0 

0 0 \ 


0 0 - 0 2/?i ••- 2P Z 

d2 ai ■■- a v 0 … 0 

0 0 ••- 0 2<5i - 2S Z 

d，n 7i ••- 7y 0 … 0 

-2Ci 

U 0 

— 2 C» 

0 

0 V 


V vz o e z o 

and so C is similar to 


/ 


〈 ai 

02 2/3i ••• 

2/3,) 


f dy 

di a! 

… a y > 

03 

04 2^1 --. 

2S Z 


^3 

d* 7i 

… lv 

m 

Oi 


+ 

- 2 ei 

- 2 C! 



: V 





U 

\ Vz 

e z 

) 


〈 一 2e « 

— 2 Cy 

/ 

卜 

2 R\ \ a:. 

(s 2 

Q2 





Ti ) z 

V 2 R 2 

r 2 

)y' 



X 

z 

X 

V 





with a fixed similarity factor depending only on W. Therefore 02 — ®i where (8 = 
<3(x,y,z) is the group of matrices in of the form 



2 Ri ) 

l 1 J. 

(知 

Qi \ 


) 

N + 

V 2 R 2 

^ ) 

X 

z 


X 

y 


where = S 2 (mod 2). Here 2 x + y + z = n and x + y is odd. 

We wish to prove that SDln-i - &(x,y,z) only when i = 0 , y = 1 , « = n - 1 or 
x = 0, y = n - 1, z = l. In order to establish this, we shall prove that in all other 
cases the number of involutions in @ which are nonsimilar in )8 is greater than the 


number of involutions in 9JTn—i which are nonsimilar in Win—i； this latter number is, 
of course, A n -i (given by ( 6 )). 

We shall briefly denote the elements of 0 by A + B, where 


(Si 2Ri \ 
V Qi Ti ) 


and 


B 


St 

2R 2 


Qi 

T 2 
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If Ai -i- Bi and A 2 + B 2 are two involutions in 0, where either 

in M x+Z or 

B x ^B 2 

in Sffla+j,, then certiunly 

Ai -}- B\ ^ A2 + B2 

in © (these may be similar in ®ln, however). Therefore, the matrices A + B, where 
A -/ (oi) + + L+- + L, 

(Ci terms) 

B = I^ + (-I) iba) + L + ••• + L, 

(C2 terms) 

obtained by taking different sets of values of (ai, , cj, 02,621 C 2 ), if they lie in (5, are 
certainly nonsimilar in (5. Here we have 

ai + 61 + 2 ci = i + z, a 2 + 62 + 2 c 2 = 1 + y, 61 H- 62 + ci + C 2 even. 

If x 0, we impose the further restriction that ci < (2 + 1)/2, eg < (y+ 1)/2, and 
that in B instead of L we use V . These conditions will insure that A + B € <S. We 
certainly do not (in general) get all of the nonsimilar involutions of (8 in this way, but 
instead we obtain only a subset thereof. Call the number of such matrices N. 

For i = 0, we have N = B v B t + {A y — B V )(A Z — B z ). Since y is odd, A v = 2 B y , 
and therefore 

N = ByA z = ByA n -y. 

Case 1 n even. Then N ~ (y + l)(y + 3)(n — y+ l)(fi — y + 3)/32. If neither 
y nor n — y is 1 (certainly neither can be zero), then 

(y +l)(n-y +1) ^ 4(n-2) and (y + 3)(n - y + 3) > 6 n, 

so that 

(24/32)n(n - 2). 

For n = 4, x = 0, either y = 1 or 2 = 1. For n ^ 6 , we have N > A n —i. Hence in this 
case 0 is not isomorphic to OT„_j (if either y or n - y = 1, then W{x,y,z) = ±Ji). 

Case 2 n odd. Then N = (y+ l)(y + 3)(n -y + 2) 2 /32. We find again that 
N > A n - 1 for n > 5. 

This settles the cases where 1 = 0. Suppose that 1 一 0 hereafter. Then N is the 
number of solutions of 
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ai + 61 + 2 ci = a: + z, 02 + i >2 + 2 c 2 = a: + y, fei + 62 + cv + C 2 even, 

… “宇， 0 …中. 

Using [r] to denote the greatest integer less than or equal to r, we readily find that 
N is given by 


'z + j 
~~2 


1 fy+ 31 ( ， 






By considering separately the cases where y and z are both even, one even and one 
odd, and so on, it is easy to prove that N ^ An-i in all cases except when both y and z 
are zero. Leaving aside this case for the moment, consider the matrix Ao + e ©, 
where Aq G ^Jt x +z is given by 


M = 


12 2 
0-10 
0 0-1 


,00 0 

The matrix Ao + J"(* +v ) is certainly an involution in ©. Since, in 9n i+ *, 

(1 0 … 0 、 

0 -1 … 0 


Aq = 


=j4i, 


Ao + I^ x+ v) can be similar (in 0) only to that matrix (counted in the N matrices) of 
the form Ai + /(*+**)• But from 


/ ai a 2 ••- a x 2bi ••- 26* \ / 01 ■■- a x 2b\ ■■- 2b z 



ai = a2 = ••■ = Ox = 26j, 


we obtain 
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the plus signs go together, as do the minus signs. Furthermore, by our induction 
hypothesis, 

X r = ±AX*A~ l , 


where A € and either X* = X for all X e or X" - X ,_l for all 
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The minus sign here is impossible by Lemma 2, since n ^ 4. Hence J\J n is invariant, 
and therefore so is J n . By the same reasoning all of the J v [y = 1, • • • , n) are invariant. 
From the above remarks we see that for X 6 WlJ-p 


( 1 n， V = ( 1 n, \ ( X « V = / AnX^A - 1 n 

\ n X / ~\n AiX^A ^ 1 ) ' ' \ n' 1 ) n' 1 


where A v 6 and in fact Ai = I. Now suppose that Z 6 £DT^_ 2 , and form 

/( 2 ) 2. Since it commutes with both J\ and J- 2 , its image must do likewise. But 


then 



for every Z G £0l^_ 2 . Setting 



we obtain fZ = y 7 , 5 = Zt). Since this holds for all Z € 义 _ 2 , we must have 
p = 5 = n, so that Ai is itself decomposable. A similar argument (considering 
the matrices commuting with both J\ and J u , for v = 3, ••- ,n) shows that Ai is 
diagonal. Correspondingly, all of the A u are diagonal. It is further clear that all of 


inner automorphism factor D -1 , we may henceforth assume th 
decomposable X e where either X* = X always or X* : 
tOtn is generated by the set of decomposable elements of SJtJ, t 


Automorphisms of the projective unimodular 
group* 


Notation Let fJJln denote the group of n x n integral matrices of determi¬ 
nant ±1 (the unimodular group). By 3HJ we denote that subset of Win where the 
determinant is +1; is correspondingly defined. Let ^p 2 n be obtained from 97l2n 
by identifying +X and —X, X 6 3Jl2 n (this is the same as considering the factor 
group of OT 2 r» by its centrum). We correspondingly obtain and ?PJ n from 9Jl^ n 
and Let /( n ) (or briefly I) be the identity matrix in fJJln, and let X' denote the 
transpose of X. The direct sum of A and B is represented by /! + B, while 

A = B 

means that A is similax to B . 

In this paper we shall find explicitly the generators of the group ® 2 »» of all auto¬ 
morphisms of 屯 2 n, thereby obtaining a complete description of these automorphisms. 
This generalizes the result due to Schreier ① for the case r» = 1. 

We shall frequently refer to results of an earlier paper: Automorphisms of the 
unimodular group (L. K. Hua and I. Reiner. Trans. Amer. Math. Soc., 1951, 71: 
331-348). We designate this paper by AUT. 

1. The commutator subgroup of 

The following useful result is an immediate consequence of the corresponding 
theorem for 97l2 n (AUT, Theorem 1). 

Theorem 1 Let be the commutator subgroup of ^p 2 n- Then clearly © 2 r» C 
^ n . For n = 1, © 2 n is of index 2 in while for n> 1, © 2 n = 屯 

Theorem 2 In any automorphism of^ 2 n, always ^ e if- 

Proof This is a corollary to Theorem 1 when n > 1, since the commutator 
subgroup goes into itself under any automorphism. For n = 1, suppose that ±S 
±Si and ±T —> ±Ti, where 

* Received by the editors May 18, 1951. Reprinted from Transactions of the American 
Mathematical Society, 1952, T2(3): 467-473. By Hua L K and Reiner I. 

① Abh. Math. Sem. Hamburgischen Univ., 1924, 3: 167. 
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( 1 ) 


Since S and T generate SDlJ, it follows that 士 S and ±T generate 屯 and hence so 
must ±5i and ±2\. It is therefore sufficient to prove that det Si = det T\ = +1. From 
(ST ) 3 = / we deduce SiTi = ±Tf so that det SiTi = 1. Hence either 

Si and Ti are both in ^ or both in 屯 J; we shall show that the latter alternative is 
impossible. 

Suppose that det Si = det Ti = —1. FVom 5 s = / we deduce = ±J; if 
— —I, then 5? + / = 0 and the characteristic equation of 5i is A 2 + 1 = 0, from 
which it follows that det Si = 1 ; this contradicts our assumption that det 5i = — 1 ， 
so of necessity 5? = I. But if this is the case, then it is easy to show that there exists 
a matrix A 6 OTI 2 such that ASiA -1 takes one of the two canonical forms 

{i -0 ^ 

By considering instead of the original automorphism r, a new automorphism t' defined 
by: X r， = AX r A~ 1 , we may hereafter assume that 



Let 

T 1 = ± 




then ad —be = — 1 . 

Now we observe that J = (1) + (—1) is distinct from ±/ and ±S, that it com¬ 
mutes with 5, and that JT is an involution. Hence there exists a matrix M 6 ^2 
distinct from 士 / and ±5i，such that A/commutes with Si, and MT\ is an involution. 
Case 1 

Si = 士 

Since (S 1 T 1) 3 = ±/, we find that a — d = 士 1. The only matrices commuting with Si 
which are distinct from 土 / and ±5i are 
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If M is either of the first two matrices, then the condition that MT\ be an involution 
yields 6 + c = 0. Thus a = d ± 1 ， 6 = —c, and ad — be = —1. Combining these, we 
obtain d{d ± 1) + c 2 = —1, which is impossible. The other two choices for M imply 
6 = c, and therefore d{d ± 1) — c 2 = —1. Hence 1 — 4(1 — c 2 ) is a perfect square; but 
4c 2 — 3 — f 2 implies (2c + /)(2c — /) = 1, whence c = ±1. But then ad = 0; from 
a — d = 士 1 we deduce that a 2 — d 2 = ± 1 , whence (SiT^) 3 = ±7, which is impossible. 

Case 2 



From (S^) 3 = ±1 we obtain a — d + b = ±1. For M there are the four 
possibilities 



Since MT\ is an involution, in the first two cases we have a — 2c — d = 0, whence 
ad — be = {(a + d) 2 + (a — d ± l) 2 — 1}/4 一 —1. 

In the second two cases we find that a — 2c + 6 — d = 0, so that 2c = a + b — d = ±1, 
which is again a contradiction. This completes the proof of Theorem 2. 


2. Automorphisms of 

Let us now determine all automorphisms of 屯 2 . Since every such automorphism 
takes into itself, we begin by considering all automorphisms of fpj • 

Theorem 3 Every automorphism o/ 屯 } is o/ f/ie form X G ?P 2 ~* AXA~ x 
for some A € 9512 ； that is, all automorphisms of^ are "inner" (with A 6 SDI 2 rather 
than A € 

Proof Let r be any automorphism of 时 ， and define S 1 and T as before; let 
So G SW 2 be a fixed representative of 土 5^ By Theorem 2, So G and therefore 
Sq = —I. Let To be that representative of ±T T for which (S。! 1 。) 3 = / is valid. Then 
S —* S。, T To induces a mapping from SUlJ onto itself. The mapping is one-to-one, 
for although an element of can be expressed in many different ways as a product 
of powers of S and T, these expressions can be gotten from one another by use of 
5 2 = —I, (ST) 3 = /; since So and To satisfy these same relations, the mapping is 
one-to-one. It is an automorphism because r is one. Therefore (AUT, Theorem 2) 
there exists an A € OTI 2 such that So = 士 ASj4 _1 ， IT。= This proves the 

result. 
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Corollary 













'； iv 


■ 




Remark For n = 1, the automorphism (ii) is a special case of ( 1 ). 

In. the proof of Theorem 4 by induction on n, the following lemma (which has 
already been established for n = 1 ) will be basic: 

Lemma 1 Let J\ = (-1) +/ (2n-1) . In any automorphism r o/^n, = 

±AJ\A~ l for some A G 9H2n- 

Proof The result is already known for n = 1. Hereafter let n > 2. Certainly 
(Jf ) 2 = ±1 and det = -1. If (Jf ) 2 = -I, then the minimum function of 
J\ is A 2 + 1, and its characteristic function must be some power of A 2 + 1, whence 
det J\ = 1. Therefore {J{) 2 = /is valid in £0l 2 n- After a suitable inner automorphism, 
we may assume that 

Jl = W(x, y,z) = L + ■■■+L + (-I)^ + & ) , 

where 

L 

occurs x times, 2x + y + z = 2n, and x + y is odd (this follows from AUT, Lemma 1). 

Let 0i be the group consisting of all elements of 屯 2 n which commute with Ji, 
anH (Ri t.hfi rnrresnontline erouD for JT. The lemma will be proved if we can show that 
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Those elements ±C G 平 2 n which commute with W must satisfy one of the two 
equations: CW = WC or CW — -WC. The solutions of the first of these equations 
form a subgroup of 02 , and this subgroup is known (see AUT, proof of Lemma 2) to 
be isomorphic to 0 o = <&o(.x,y,z) consisting of all matrices in 屯 2 n of the form 


(Si 

2 R 1 \ 

v ( 

Qi \ 

\Q^ 


V 2i?2 

r 2 J 


where Si, S^, T\, and Ti are square matrices of dimensions x, x, z, and y respectively, 
and where Si = Si (mod 2), 2x + y + z = 2n, and x + y and x + z are both odd. 

Next we prove that CW = —WC is solvable only when y = z. The space U of 
vectors u such that Wu = u is of dimension x + z, while the space © of vectors b for 
which Wb = -b has dimension x + y. But if CW = —WC, then WC\x = —Cu and 
WC^b = C~ l b, so the dimensions of 11 and © must be the same, whence y = z. 
Hence ifyj^z, there are no solutions of CW = —WC, C € 

We may now proceed to find a lower bound for the number of nonsimilar matrices 
in <So{x,y,z). We briefly denote the elements of 0o by i4 -f B, where 


(Si 2R X \ 


and 



Q2 

t 2 


If Ai -j- B\ and A<x 4- are two distinct involutions in 0。， where either 
A\ ^ A 2 in M x+Z or Bi ^ in M x+V , 


then certainly 


Ai + Bi ^ A2 + B2 in (Bq- 


Now let 

A = /<° l > + (-/) (6l > + L+- - +L, 

B = I^ aa) + (-I) {h) + L+ ■•■+L, 

where L occurs ci times in A and c 2 times in B\ the various elements A-\-B gotten 
by taking different sets of values of (ai, 6 i, 01 , 02 , 62 . ca)i if they lie in 0 o, are certainly 

nonsimilar in ® 0 , except that A + B and {-A) + (-B) are the same element of 0 O . 

Hence the number N of nonisimilar involutions of 0o is at least half of the number 
Ni of solutions of 

Oi + 61 + 2 ci = x + z, 


a 2 + 62 + 2 ca = x + y, 
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where if s 0 we impose the restrictions that cx ^ (z + 1 )/ 2 , C 2 彡 (y + 1 )/ 2 , and 
that in B instead of L we use L' (these conditions insure that A + B 6 0o)- As in 
the previous paper, one readily shows that N > n(n + 1) unless = ±Ji. We omit 
the details. 

This leaves only the case where y = z.li CW — —WC, then C k W = (— l) k WC k ; 
therefore no odd power of C can be ±/. Let p be a prime such that n < p < 2n. 
Since x + y = n, certainly n is odd, and p > n + 2. Now (being isomorphic to 
2 J? 2 n-i) contains infinitely many elements of order p. However, 02 contains only two 
such elements, since CP ^ by the above argument, while if (7 € ®o and C p = ±/, 
then setting C = + fl( n ) shows that A p = ±1 and B p = ±1. However, A 6 OTl n , 

and if A p = ±1, then the minimum function of A must divide 干 1. But the degree 
of the minimum function is at most n, and therefore is less than p — 1 , whereas 
is the product of a linear factor 入干 1 and an irreducible factor of degree p— 1 ; thence 
the minitnnni function ofAisA 干 l ， so_<4 = ±/. In the same way B = ±/. Hence the 
only solutions are C = / (n) + / (n) and C = - J (n) + / (n) . This completes the proof of 
the lemma. We remark that the use of the existence of the prime p could have been 
avoided, but the proof is much quicker this way. 


4. Proof of the main theorem 

We are now ready to prove Theorem 4 by induction on n. Hereafter, let n ^ 2 
and assume that Theorem 4 holds for n-1. Let r be any automorphism of ^ 2 n ； then 
by Lemma 1 ， •/ 『 = 士 ■AAA -1 for some A e OT 2n . If we change r by a suitable inner 
automorphism, we may assume that JJ = ±Ji. 

Therefore, every M € 屯 2 « which commutes with J\ goes into another such 
element, that is, 



where n denotes a column vector all of whose components are zero, and X G 9 Jl 2 n-i. 
Thus, r induces an automorphism on 9 Tl 2 n-i. Consequently (AUT, Theorem 4) there 
exists a matrix A e M-in-i such that Y = AX*A~ 1 for all X € Tt 2n -i, where either 
X* = X for all X € SW 2 „-i or X* = X'~ l for all X e After a further inner 

automorphism by a factor of ( 1 ) +A -1 , we may assume that = ±J\ and also that 

= y = X* for all X € 

Let J v be obtained from / '( 2n ) by replacing the t*th diagonal element by —1. 














■376 - 


华罗庚文集代数卷 II 


Then 


/ 1 0 … 0 0 \ 

0 -1 … 0 0 

W^2n) T = ± i : :: 

0 0 … -1 0 

〈0 0 … 0 1 ) 

so that 土 J^n is invariant. Similarly, all 
invariant. Therefore for any X 6 VJl 2 n -i w 


1 

n' 

\ 


'-1 ••- 0 0 \ 


n 

0 … -1 0 


V 

i 0 .•- 0 1 j 

/ 


= ±JiJ2 n , 

f the matrices ±(y = 1,." ,2n) are 
have 




n ; 

VAr 1 






with A u € OT 2 n-i, and in fact A\ = I. 

Now suppose that Z € 97l2n-2> and consider 土 (Z + /( 2 )); since it commutes with 
J 2 n-i and so does its image. But therefore 


^2n 



where 2 denotes some matrix in ®l 2 n- 2 . FVom this one easily deduces that ^ 2 n 
must be of the form B + (1), with B G 97l2n-2- By considering the matrices com¬ 
muting with J v and Jin for v = 1, • • • , 2n - 2 we see that A 2 n must be diagonal. 
Furthermore, it is clear that all of the A v [y = 1 ， ... ,2n) must be diagonal, and all 
are sections of one diagonal matrix i?( 2n ). Using the further inner automorphism 
factor D~ l , we find that ±X T = ±X* for every decomposable matrix ±X € 屯 2 n. 
Since <p 2 n is generated by the set of its decomposable matrices, the theorem is proved. 
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